AI augmented photo booth using the DGX Spark and Reachy Mini.
Spark & Reachy Photo Booth is an interactive and event-driven photo booth demo that combines the DGX Spark™ with the Reachy Mini robot to create an engaging multimodal AI experience. The system showcases:
NeMo Agent Toolkitopenai/gpt-oss-20b LLM powered by TensorRT-LLMnvidia/riva-parakeet-ctc-1.1B and hexgrad/Kokoro-82Mblack-forest-labs/FLUX.1-Kontext-dev for image-to-image restylingfacebookresearch/detectron2 and FoundationVision/ByteTrackThe demo is based on a several services that communicate through a message bus.
NOTE
This playbook applies to both the Reachy Mini and Reachy Mini Lite robots. For simplicity, we’ll refer to the robot as Reachy throughout this playbook.
You'll deploy a complete photo booth system on DGX Spark running multiple inference models locally — LLM, image generation, speech recognition, speech generation, and computer vision — all without cloud dependencies. The Reachy robot interacts with users through natural conversation, captures photos, and generates custom images based on prompts, demonstrating real-time multimodal AI processing on edge hardware.
Hardware Requirements:
TIP
Make sure your Reachy robot firmware is up to date. You can find instructions to update it here. Software Requirements:
NVIDIA_API_KEY). Create a key if necessary. Make sure to enable the NGC Catalog scope when creating the key.HF_TOKEN). Create a token if necessary. Make sure to create a token with Read access to contents of all public gated repos you can access permission.All required assets can be found in the Spark & Reachy Photo Booth repository.
Your use of the Spark Playbook scripts is governed by Apache License, Version 2.0 and enables use of separate open source and proprietary software governed by their respective licenses: Flux.1-Kontext NIM, Parakeet 1.1b CTC en-US ASR NIM, TensorRT-LLM, minio/minio, arizephoenix/phoenix, grafana/otel-lgtm, Python, Node.js, nginx, busybox, UV Python Packager, Redpanda, Redpanda Console, gpt-oss-20b, FLUX.1-Kontext-dev, FLUX.1-Kontext-dev-onnx.
NOTE
FLUX.1-Kontext-dev and FLUX.1-Kontext-dev-onnx are models released for non-commercial use. Contact sales@blackforestlabs.ai for commercial terms. You are responsible for accepting the applicable License Agreements and Acceptable Use Policies, and for ensuring your HF token has the correct permissions.