Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Deploy Models Now with NVIDIA NIM
Optimized inference for the world’s leading models
Get API Key
Free serverless APIs for development
Accelerated by DGX Cloud
Self-Host on your GPU infrastructure
Continuous vulnerability fixes
Discover
Models
Blueprints
GPUs
Docs
Forums
workstations
Run on RTX
Run on Spark
models
Reasoning
Vision
Visual Design
Retrieval
Speech
Biology
Simulation
Climate & Weather
Safety & Moderation
industries
Automotive
Financial Services
Gaming
Healthcare
Industrial
Robotics
Reasoning
Developer Favorites
The top large language models for your enterprise AI
OpenAI
Downloadable
gpt-oss-20b
Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math
Model
chat
+3
7mo
NVIDIA
Downloadable
nemotron-3-nano-30b-a3b
Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
Model
Instruction Following
+3
3mo
NVIDIA
Downloadable
cosmos-reason2-8b
Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
Model
Physical AI
+7
3mo
DeepSeek AI
Free Endpoint
deepseek-v3.1
DeepSeek V3.1 Instruct is a hybrid AI model with fast reasoning, 128K context, and strong tool use.
Model
Reasoning
+1
7mo
Fresh Off the Press
The latest innovations in intelligence models
Moonshotai
Downloadable
kimi-k2.5
1T multimodal MoE for high‑capacity video and image understanding with efficient inference.
Mixture-of-Experts
+2
2mo
Z.ai
Free Endpoint
glm-4.7
GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
Coding
+3
2mo
NVIDIA
Free Endpoint
nemotron-content-safety-reasoning-4b
A context‑aware safety model that applies reasoning to enforce domain‑specific policies.
NeMo Guardrails
+3
2mo
NVIDIA
Downloadable
cosmos-reason2-8b
Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
Physical AI
+7
3mo
DeepSeek AI
Free Endpoint
deepseek-v3.2
State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
long context
+2
3mo
NVIDIA
Downloadable
nemotron-3-nano-30b-a3b
Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
Instruction Following
+3
3mo
Mistral AI
Free Endpoint
devstral-2-123b-instruct-2512
State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.
agentic
+3
3mo