NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
DiscoverModelsBlueprintsGPUsDocsForums

workstations

  • Run on RTX
  • Run on Spark
  • Run on Station

models

  • Reasoning
  • Vision
  • Visual Design
  • Retrieval
  • Speech
  • Biology
  • Simulation
  • Climate & Weather
  • Safety & Moderation

industries

  • Automotive
  • Financial Services
  • Gaming
  • Healthcare
  • Industrial
  • Robotics

Reasoning

Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Deploy Models Now with NVIDIA NIM

Optimized inference for the world’s leading models
Free serverless APIs for development
Accelerated by DGX Cloud
Self-Host on your GPU infrastructure
Continuous vulnerability fixes

Fresh Off the Press

The latest innovations in intelligence models

Moonshotai
Downloadable

kimi-k2.6

1T multimodal MoE for long-horizon coding, agentic tool use, and image/video understanding.
Mixture-of-Experts
2w
Qwen
Downloadable

qwen-image

Qwen-Image is a text-to-image foundation model with advanced multilingual text rendering.
2w
Qwen
Downloadable

qwen-image-edit

Qwen-Image-Edit is an image editing model with multilingual text editing and strong subject consistency.
2w
Mistral AI
Downloadable

mistral-medium-3.5-128b

A high performing model for text generation, coding and agentic use cases
agentic
2w
NVIDIA
Downloadable

nemotron-3-nano-omni-30b-a3b-reasoning

Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.
OCR
2w
DeepSeek AI
Downloadable

deepseek-v4-flash

DeepSeek V4 Flash is a 284B MoE model with 1M-token context optimized for fast coding and agents.
MoE
3w
DeepSeek AI
Downloadable

deepseek-v4-pro

DeepSeek V4 scales to 1M-token context windows with efficient MoE architecture for coding tasks.
Moe
3w
Z.ai
Downloadable

glm-5.1

GLM-5.1 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.
Agentic AI
3w
Minimaxai
Free Endpoint

minimax-m2.7

MiniMax M2.7 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.
coding
1mo
NVIDIA
Free Endpoint

nemotron-3-content-safety

Multilingual, multimodal model for detecting unsafe and toxic content.
ai safety nemo guardrails
4w

Developer Favorites

The top large language models for your enterprise AI

DeepSeek AI
Downloadable

deepseek-v4-pro

DeepSeek V4 scales to 1M-token context windows with efficient MoE architecture for coding tasks.
Moe
3w
NVIDIA
Downloadable

nemotron-3-nano-omni-30b-a3b-reasoning

Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.
OCR
2w
Z.ai
Downloadable

glm-5.1

GLM-5.1 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.
Agentic AI
3w
Google
Downloadable

gemma-4-31b-it

Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.
agentic
1mo
OpenAI
Downloadable

gpt-oss-20b

Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math
chat
9mo
NVIDIA
Downloadable

nemotron-3-nano-30b-a3b

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
Instruction Following
5mo
NVIDIA
Downloadable

cosmos-reason2-8b

Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
Physical AI
4mo