Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
217 models
Sort By
dateCreated:DESC
Most Recent
NVIDIA
Downloadable
nemotron-ocr-v1
Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
Table Extraction
+4
Today
NVIDIA
Downloadable
nemotron-3-super-120b-a12b
Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
chat
+5
329K
1d
NVIDIA
Downloadable
llama-nemotron-rerank-1b-v2
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
nemo retriever
+2
4.35K
6d
Qwen
API Endpoint
qwen3.5-122b-a10b
122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.
chat
+4
1.49M
1w
NVIDIA
Downloadable
nemotron-table-structure-v1
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+5
6.07K
1w
NVIDIA
Downloadable
nemotron-page-elements-v3
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+4
8.29K
1w
NVIDIA
Downloadable
nemotron-graphic-elements-v1
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+5
5.8K
1w
NVIDIA
Downloadable
llama-nemotron-embed-1b-v2
Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
Text-to-Embedding
+2
366K
1w
NVIDIA
API Endpoint
gliner-pii
GLiNER PII detects Personally Identifiable Information in text.
PII Detection
+1
145K
1w
Minimaxai
Downloadable
minimax-m2.5
MiniMax M2.5 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.
coding
+3
4.19M
2w
NVIDIA
API Endpoint
cosmos-transfer2.5-2b
Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
Synthetic Data Generation
+4
2w
Qwen
Downloadable
qwen3.5-397b-a17b
Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
chat
+4
8.02M
3w
Z.ai
Downloadable
glm5
GLM-5 744B MoE enables efficient reasoning for complex systems and long-horizon agentic tasks.
chat
+3
9.8M
3w
NVIDIA
Downloadable
llama-nemotron-embed-vl-1b-v2
Multimodal question-answer retrieval representing user queries as text and documents as images.
nemo retriever
+3
883K
1mo
Minimaxai
API Endpoint
minimax-m2.1
MiniMax M2.1 excels in multi-language coding, app/web dev, office AI, and agent integration
chat
+3
8.33M
1mo
Stepfun-ai
API Endpoint
step-3.5-flash
200B open-source reasoning engine with sparse MoE powering frontier agentic AI.
chat
+3
7.8M
1mo
Moonshotai
Downloadable
kimi-k2.5
1T multimodal MoE for high‑capacity video and image understanding with efficient inference.
chat
+4
22.84M
1mo
Z.ai
API Endpoint
glm4.7
GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
Tool Calling
+4
17.73M
1mo
NVIDIA
API Endpoint
nemotron-content-safety-reasoning-4b
A context‑aware safety model that applies reasoning to enforce domain‑specific policies.
NeMo Guardrails
+3
569K
1mo
NVIDIA
Downloadable
cosmos-reason2-8b
Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
video understanding
+8
237K
2mo
NVIDIA
Downloadable
nemoretriever-page-elements-v3
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+5
724K
2mo
DeepSeek AI
API Endpoint
deepseek-v3.2
State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
chat
+3
16.35M
2mo
NVIDIA
Downloadable
nemotron-3-nano-30b-a3b
Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
chat
+4
12.23M
2mo
NVIDIA
API Endpoint
riva-translate-4b-instruct-v1_1
Translation model in 12 languages with few-shots example prompts capability.
nvidia nim
+2
567K
3mo
Items per page
24
1
1
2
2
3
3
4
4
5
5
...
10
10
of 10 pages