NVIDIA
Explore
Models
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2025 NVIDIA Corporation

View All Playbooks
View All Playbooks

onboarding

  • Set Up Local Network Access
  • Open WebUI with Ollama

data-science

  • Optimized JAX
  • Text to Knowledge Graph

tools

  • Comfy UI
  • DGX Dashboard
  • VS Code
  • RAG application in AI Workbench
  • Set up Tailscale on your Spark

fine-tuning

  • FLUX.1 Dreambooth LoRA Fine-tuning
  • LLaMA Factory
  • Fine-tune with NeMo
  • Fine tune with Pytorch
  • Unsloth on DGX Spark
  • Vision-Language Model Fine-tuning

use-case

  • Build and Deploy a Multi-Agent Chatbot
  • NCCL for Two Sparks
  • Connect Two Sparks
  • Video Search and Summarization

inference

  • Multi-modal Inference
  • NIM on Spark
  • NVFP4 Quantization
  • Speculative Decoding
  • TRT LLM for Inference
  • Install and Use vLLM for Inference

Open WebUI with Ollama

15 MIN

Install Open WebUI and use Ollama to chat with models on your Spark

Basic idea

Open WebUI is an extensible, self-hosted AI interface that operates entirely offline. This playbook shows you how to deploy Open WebUI with an integrated Ollama server on your DGX Spark device using NVIDIA Sync. The setup creates a secure SSH tunnel that lets you access the web interface from your local browser while the models run on Spark's GPU.

What you'll accomplish

You will have a fully functional Open WebUI installation running on your DGX Spark, accessible through your local web browser via NVIDIA Sync's managed SSH tunneling. The setup includes integrated Ollama for model management, persistent data storage, and GPU acceleration for model inference.

What to know before starting

  • How to use NVIDIA Sync to connect to your DGX Spark device

Prerequisites

  • DGX Spark device is set up and accessible
  • NVIDIA Sync installed and connected to your DGX Spark
  • Enough disk space for the Open WebUI container image and model downloads

Time & risk

  • Duration: 15-20 minutes for initial setup, plus model download time (varies by model size)
  • Risks:
    • Docker permission issues may require user group changes and session restart
    • Large model downloads may take significant time depending on network speed

Resources

  • Open WebUI Documentation
  • DGX Spark Documentation
  • DGX Spark Developer Forum