Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
77 results for
Filters
Models (56)
Blueprints (7)
Other (14)
Sort By
score:DESC
Best Match
DGX Spark
20 MIN
Live VLM WebUI
Real-time Vision Language Model interaction with webcam streaming
Playbook
Vision AI
+4
3mo
NVIDIA
Downloadable
llama-3.2-nemoretriever-1b-vlm-embed-v1
Multimodal question-answer retrieval representing user queries as text and documents as images.
Model
nemo retriever
+3
203K
9mo
DGX Spark
1 HR
Vision-Language Model Fine-tuning
Fine-tune Vision-Language Models for image and video understanding tasks using Qwen2.5-VL and InternVL3
Playbook
DGX
+6
6mo
NVIDIA
Launchable
LLM Router
Route LLM requests to the best model for the task at hand.
Blueprint
NVIDIA AI
+1
1mo
Qwen
Downloadable
qwen3.5-397b-a17b
Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
Model
chat
+4
13.84M
1mo
Google
Deprecation in 2d
Free Endpoint
paligemma
Vision language model adept at comprehending text and visual inputs to produce informative responses
Model
image
+8
41.97K
1y
NVIDIA
Deprecation in 2d
Free Endpoint
retail-object-detection
EfficientDet-based object detection network to detect 100 specific retail objects from an input video.
Model
Object Detection
+8
480
1y
NVIDIA
Deprecation in 2d
Free Endpoint
visual-changenet
Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask
Model
image
+8
622
1y
DGX Spark
30 MIN
vLLM for Inference
Install and use vLLM on DGX Spark
Playbook
DGX
+2
1mo
Z.ai
Free Endpoint
glm-4.7
GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
Model
Tool Calling
+4
15.01M
2mo
Z.ai
Deprecation in 7d
Downloadable
glm-5
GLM-5 744B MoE enables efficient reasoning for complex systems and long-horizon agentic tasks.
Model
MoE
+3
39.55M
1mo
DGX Spark
1 HR
TRT LLM for Inference
Install and use TensorRT-LLM on DGX Spark
Playbook
DGX
+2
6mo
NVIDIA
Multi-LLM NIM
Use the multi-LLM compatible NIM container to deploy a broad range of LLMs from Hugging Face.
Blueprint
nim
1mo
DGX Station
20 MIN
Serve Qwen3-235B with vLLM
Set up vLLM server with Qwen3-235B on DGX Station
Playbook
Station
+2
1mo
NVIDIA
Deprecation in 2d
Free Endpoint
ocdrnet
OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.
Model
Optical Character Recognition
+8
1.05K
1y
DGX Spark
LM Studio on DGX Spark
Deploy LM Studio and serve LLMs on a Spark device; use LM Link to access models remotely.
Playbook
Inference
+4
2mo
NVIDIA
Downloadable
nemotron-nano-12b-v2-vl
Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
Model
chat
+4
4.48M
5mo
NVIDIA
Downloadable
llama-3.1-nemotron-nano-vl-8b-v1
Multi-modal vision-language model that understands text/img and creates informative responses
Model
chat
+3
8.91M
9mo
NVIDIA
Downloadable
llama-nemotron-embed-vl-1b-v2
Multimodal question-answer retrieval representing user queries as text and documents as images.
Model
nemo retriever
+3
8.02M
2mo
NVIDIA
Downloadable
llama-nemotron-rerank-vl-1b-v2
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
Model
nemo retriever
+2
522
1w
DGX Spark
30 MIN
NIM on Spark
Deploy a NIM on Spark
Playbook
DGX
+1
6mo
Mistral AI
Downloadable
ministral-14b-instruct-2512
A general purpose VLM ideal for chat and instruction based use cases
Model
chat
+4
1.35M
4mo
DGX Spark
30 MIN
Nemotron-3-Nano with llama.cpp
Run Nemotron-3-Nano-30B model using llama.cpp on DGX Spark
Playbook
Nemotron
+3
3mo
NVIDIA
Launchable
Ambient Healthcare Agents
Build advanced AI agents for providers and patients using this developer example powered by NeMo Microservices, NVIDIA Nemotron, Riva ASR and TTS, and NVIDIA LLM NIM
Blueprint
NVIDIA AI
+3
1mo
Items per page
24
1
1
2
2
3
3
4
4
of 4 pages