Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
53 results for
Filters
Models (29)
Blueprints (7)
Other (17)
Sort By
score:DESC
Best Match
NVIDIA
Launchable
LLM Router
Route LLM requests to the best model for the task at hand.
Blueprint
NVIDIA AI
+1
2mo
Items per page
24
1
1
2
2
3
3
of 3 pages
DGX Spark
1 HR
TRT LLM for Inference
Install and use TensorRT-LLM on DGX Spark
Playbook
DGX
+1
6mo
NVIDIA
Multi-LLM NIM
Use the multi-LLM compatible NIM container to deploy a broad range of LLMs from Hugging Face.
Blueprint
nim
2mo
DGX Spark
30 MIN
NIM on Spark
Deploy a NIM on Spark
Playbook
DGX
+1
6mo
DGX Spark
30 MIN
Nemotron-3-Nano with llama.cpp
Run Nemotron-3-Nano-30B model using llama.cpp on DGX Spark
Playbook
Nemotron
+3
4mo
NVIDIA
Launchable
Ambient Healthcare Agents
Build advanced AI agents for providers and patients using this developer example powered by NeMo Microservices, NVIDIA Nemotron, Riva ASR and TTS, and NVIDIA LLM NIM
Blueprint
NVIDIA AI
+3
2mo
DGX Spark
30 MIN
Run models with llama.cpp on DGX Spark
Build llama.cpp with CUDA and serve models via an OpenAI-compatible API (Nemotron 3 Nano Omni as example)
Playbook
DGX Spark
+3
3w
NVIDIA
Downloadable
nemoguard-jailbreak-detect
Industry leading jailbreak classification model for protection from adversarial attempts
Model
nemo guardrails
+6
34.18K
10mo
NVIDIA
Launchable
Biomedical AI-Q Research Agent Blueprint
Build advanced AI agents within the biomedical domain using the AI-Q Blueprint and the BioNeMo Virtual Screening Blueprint
Blueprint
Retrieval-augmented generation
+1
2mo
DGX Spark
20 MINS
CLI Coding Agent
Build local CLI coding agents with Ollama
Playbook
Coding
+6
2d
DGX Station
30 MINS
Local Coding Agent
Run local CLI coding agents with Ollama on DGX Station (NVIDIA GB300) using glm-4.7-flash (fast) or unsloth/GLM-4.7-GGUF:Q8_0 (best quality)
Playbook
Coding
+5
1mo
DGX Spark
OpenClaw 🦞
Run OpenClaw locally on DGX Spark with LM Studio or Ollama
Playbook
DGX
+3
1mo
Z.ai
Downloadable
glm-5.1
GLM-5.1 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.
Model
Agentic AI
+3
2.53M
1w
Z.ai
Free Endpoint
glm-4.7
GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
Model
Tool Calling
+3
4.57M
1w
DGX Spark
30 MIN
vLLM for Inference
Install and use vLLM on DGX Spark
Playbook
DGX
+1
1mo
DGX Spark
20 MIN
Live VLM WebUI
Real-time Vision Language Model interaction with webcam streaming
Playbook
Vision AI
+4
3mo
DGX Station
20 MIN
Serve Qwen3-235B with vLLM
Set up vLLM server with Qwen3-235B on DGX Station
Playbook
vLLM
+1
1mo
DGX Station
30 MIN
Nanochat Training
Train a small ChatGPT-style LLM (nanochat) with tokenizer, pretraining, midtraining, and SFT on DGX Station with GB300 Ultra
Playbook
Training
+6
1mo
NVIDIA
Downloadable
llama-3.1-nemoguard-8b-content-safety
Leading content safety model for enhancing the safety and moderation capabilities of LLMs
Model
nemo guardrails
+4
126K
1y
NVIDIA
Free Endpoint
llama-3.1-nemotron-safety-guard-8b-v3
Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs
Model
content moderation
+4
112K
6mo
NVIDIA
Launchable
Build an AI Virtual Assistant
Create intelligent virtual assistants for customer service across every industry
Blueprint
NVIDIA AI
+4
2mo
NVIDIA
Downloadable
llama-3.1-nemoguard-8b-topic-control
Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.
Model
nemo guardrails
+4
124K
1y
Meta
Free Endpoint
llama-guard-4-12b
Multi-modal model to classify safety for input prompts as well output responses.
Model
LLM Multimodal Safety
+3
187K
10mo
NVIDIA
Free Endpoint
nemotron-3-content-safety
Multilingual, multimodal model for detecting unsafe and toxic content.
Model
llm safety
+3
23.18K
1w