Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
57 results for
Filters
Models (33)
Blueprints (7)
Other (17)
Sort By
score:DESC
Best Match
DGX Spark
20 MIN
Live VLM WebUI
Real-time Vision Language Model interaction with webcam streaming
Playbook
Vision AI
+4
3mo
Items per page
24
1
1
2
2
3
3
of 3 pages
DGX Spark
1 HR
Vision-Language Model Fine-tuning
Fine-tune Vision-Language Models for image and video understanding tasks using Qwen2.5-VL and InternVL3
Playbook
DGX
+6
6mo
NVIDIA
Free Endpoint
nemotron-3-nano-omni-30b-a3b-reasoning
Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.
Model
VLM
+4
Today
NVIDIA
Launchable
LLM Router
Route LLM requests to the best model for the task at hand.
Blueprint
NVIDIA AI
+1
2mo
Z.ai
Free Endpoint
glm-4.7
GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
Model
Tool Calling
+3
4.57M
1w
Z.ai
Downloadable
glm-5.1
GLM-5.1 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.
Model
Agentic AI
+3
2.53M
1w
Qwen
Downloadable
qwen3.5-397b-a17b
Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
Model
MoE
+3
9.6M
2mo
Google
Free Endpoint
paligemma
Vision language model adept at comprehending text and visual inputs to produce informative responses
Model
image
+8
28.56K
1y
DGX Spark
30 MIN
vLLM for Inference
Install and use vLLM on DGX Spark
Playbook
DGX
+1
1mo
DGX Spark
1 HR
TRT LLM for Inference
Install and use TensorRT-LLM on DGX Spark
Playbook
DGX
+1
6mo
NVIDIA
Multi-LLM NIM
Use the multi-LLM compatible NIM container to deploy a broad range of LLMs from Hugging Face.
Blueprint
nim
2mo
DGX Station
20 MIN
Serve Qwen3-235B with vLLM
Set up vLLM server with Qwen3-235B on DGX Station
Playbook
vLLM
+1
1mo
DGX Spark
30 MIN
LM Studio on DGX Spark
Deploy LM Studio and serve LLMs on a Spark device; use LM Link to access models remotely.
Playbook
Inference
+3
2mo
NVIDIA
Downloadable
nemotron-nano-12b-v2-vl
Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
Model
language generation
+3
4.63M
6mo
NVIDIA
Downloadable
llama-3.1-nemotron-nano-vl-8b-v1
Multi-modal vision-language model that understands text/img and creates informative responses
Model
doc intelligence
+2
7.32M
10mo
NVIDIA
Downloadable
llama-nemotron-embed-vl-1b-v2
Multimodal question-answer retrieval representing user queries as text and documents as images.
Model
nemo retriever
+3
7.17M
2mo
NVIDIA
Downloadable
llama-nemotron-rerank-vl-1b-v2
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
Model
nemo retriever
+2
7.09K
4w
DGX Station
30 MINS
NemoClaw with Nemotron-3-Super and vLLM on DGX Station
Install NemoClaw on DGX Station with local vLLM inference and Telegram bot integration
Playbook
vLLM
+8
Today
DGX Spark
30 MIN
NIM on Spark
Deploy a NIM on Spark
Playbook
DGX
+1
6mo
Mistral AI
Downloadable
ministral-14b-instruct-2512
A general purpose VLM ideal for chat and instruction based use cases
Model
language generation
+3
1.6M
4mo
DGX Spark
30 MIN
Nemotron-3-Nano with llama.cpp
Run Nemotron-3-Nano-30B model using llama.cpp on DGX Spark
Playbook
Nemotron
+3
4mo
NVIDIA
Launchable
Ambient Healthcare Agents
Build advanced AI agents for providers and patients using this developer example powered by NeMo Microservices, NVIDIA Nemotron, Riva ASR and TTS, and NVIDIA LLM NIM
Blueprint
NVIDIA AI
+3
2mo
DGX Spark
30 MIN
Run models with llama.cpp on DGX Spark
Build llama.cpp with CUDA and serve models via an OpenAI-compatible API (Nemotron 3 Nano Omni as example)
Playbook
DGX Spark
+3
3w
NVIDIA
Downloadable
nemoguard-jailbreak-detect
Industry leading jailbreak classification model for protection from adversarial attempts
Model
nemo guardrails
+6
34.18K
10mo