Skip to main content
Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
56 results for
Filters
Models (30)
Blueprints (6)
Other (20)
Sort By
score:DESC
Best Match
DGX Spark
20 MIN
Live VLM WebUI
Real-time Vision Language Model interaction with webcam streaming
Playbook
Vision AI
+4
4mo
Items per page
24
1
1
2
2
3
3
of 3 pages
DGX Spark
1 HR
Vision-Language Model Fine-tuning
Fine-tune Vision-Language Models for image and video understanding tasks using Qwen2.5-VL and InternVL3
Playbook
DGX
+6
7mo
General
Launchable
Developer Example
LLM Router
Route LLM requests to the best model for the task at hand.
Blueprint
NVIDIA AI
+1
3mo
Z.ai
Downloadable
glm-5.1
GLM-5.1 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.
Model
Agentic AI
+3
26.16M
1mo
Qwen
Downloadable
qwen3.5-397b-a17b
Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
Model
MoE
+3
12.01M
3mo
NVIDIA
Downloadable
nemotron-3-nano-omni-30b-a3b-reasoning
Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.
Model
Image-to-Text
+4
9.57M
1mo
Google
Free Endpoint
paligemma
Vision language model adept at comprehending text and visual inputs to produce informative responses
Model
image
+8
15.84K
1y
DGX Spark
30 MIN
vLLM for Inference
Install and use vLLM on DGX Spark
Playbook
DGX
+1
2mo
DGX Spark
1 HR
TRT LLM for Inference
Install and use TensorRT-LLM on DGX Spark
Playbook
DGX
+1
7mo
DGX Station
20 MIN
Serve Qwen3-235B with vLLM
Set up vLLM server with Qwen3-235B on DGX Station
Playbook
vLLM
+1
2mo
DGX Station
30 MIN
LLM Inference with SGLang
Serve LLMs with SGLang on DGX Station (Qwen3-8B default; Qwen3.6 MoE optional)—prefix-cached multi-turn, structured output, benchmarks, and inference-server guidance
Playbook
RadixAttention
+6
1d
DGX Spark
30 MIN
LM Studio on DGX Spark
Deploy LM Studio and serve LLMs on a Spark device; use LM Link to access models remotely.
Playbook
Inference
+3
3mo
NVIDIA
Downloadable
nemotron-nano-12b-v2-vl
Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
Model
language generation
+3
2.89M
7mo
NVIDIA
Downloadable
llama-3.1-nemotron-nano-vl-8b-v1
Multi-modal vision-language model that understands text/img and creates informative responses
Model
doc intelligence
+2
8.58M
11mo
NVIDIA
Downloadable
llama-nemotron-embed-vl-1b-v2
Multimodal question-answer retrieval representing user queries as text and documents as images.
Model
nemo retriever
+3
7.15M
3mo
NVIDIA
Downloadable
llama-nemotron-rerank-vl-1b-v2
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
Model
nemo retriever
+2
120K
1mo
DGX Station
30 MINS
NemoClaw with Nemotron-3-Super and vLLM on DGX Station
Install NemoClaw on DGX Station with local vLLM inference and Telegram bot integration
Playbook
vLLM
+8
1mo
DGX Spark
30 MIN
NIM on Spark
Deploy a NIM on Spark
Playbook
DGX
+1
7mo
Mistral AI
Downloadable
ministral-14b-instruct-2512
A general purpose VLM ideal for chat and instruction based use cases
Model
language generation
+3
3.09M
5mo
DGX Spark
30 MIN
Nemotron-3-Nano with llama.cpp
Run Nemotron-3-Nano-30B model using llama.cpp on DGX Spark
Playbook
Nemotron
+3
5mo
DGX Spark
30 MIN
Run Hermes Agent with Local Models
Install and run the Hermes self-improving AI agent on DGX Spark.
Playbook
Nous Research
+2
2w
DGX Spark
30 MIN
Run models with llama.cpp on DGX Spark
Build llama.cpp with CUDA and serve models via an OpenAI-compatible API (Nemotron 3 Nano Omni as example)
Playbook
DGX Spark
+3
1mo
NVIDIA
Downloadable
nemoguard-jailbreak-detect
Industry leading jailbreak classification model for protection from adversarial attempts
Model
nemo guardrails
+6
12.45K
11mo
NVIDIA
Downloadable
ising-calibration-1-35b-a3b
Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.
Model
Quantum
+3
352K
1mo