Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
84 models
Sort By
dateCreated:DESC
Most Recent
NVIDIA
Downloadable
nemotron-3-nano-omni-30b-a3b-reasoning
Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.
Image-to-Text
+4
Items per page
24
1
1
2
2
3
3
4
4
of 4 pages
2.37M
1w
NVIDIA
Downloadable
NVIDIA AI for Media Relighting
Re-illuminate people in video to match target lighting from a 360 HDRI environment map.
HDRI
+3
423
2w
NVIDIA
Free Endpoint
nemotron-3-content-safety
Multilingual, multimodal model for detecting unsafe and toxic content.
llm safety
+3
45.87K
2w
NVIDIA
Downloadable
Free Endpoint
synthetic-video-detector
NVIDIA Synthetic Video Detector is an AI-powered micro-service for detecting AI‑generated (synthetic) videos.
broadcast
+4
42.99K
2w
NVIDIA
Downloadable
Free Endpoint
Active Speaker Detection
Detect and track speaker identities across video frames.
localization
+4
1.17K
2w
NVIDIA
Downloadable
LipSync
Generative lip dubbing that syncs lips in a video to input audio.
lipsync
+6
2w
NVIDIA
Downloadable
ising-calibration-1-35b-a3b
Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.
Quantum
+3
182K
3w
NVIDIA
Downloadable
llama-nemotron-rerank-vl-1b-v2
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
nemo retriever
+2
62.19K
1mo
NVIDIA
Free Endpoint
nemotron-voicechat
Nemotron 3 Voicechat
English
+2
2.6K
1mo
NVIDIA
Downloadable
nemotron-asr-streaming
Real-time speech recognition for English
Automatic Speech Recognition
+2
19.78K
1mo
NVIDIA
Downloadable
nemotron-ocr-v1
Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
Table Extraction
+4
925K
1mo
NVIDIA
Downloadable
nemotron-3-super-120b-a12b
Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
MoE
+4
45.79M
1mo
NVIDIA
Downloadable
llama-nemotron-rerank-1b-v2
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
nemo retriever
+2
181K
2mo
NVIDIA
Downloadable
nemotron-table-structure-v1
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+5
19.14K
2mo
NVIDIA
Downloadable
nemotron-page-elements-v3
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+4
66.55K
2mo
NVIDIA
Downloadable
nemotron-graphic-elements-v1
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+5
19.85K
2mo
NVIDIA
Downloadable
llama-nemotron-embed-1b-v2
Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
Text-to-Embedding
+2
28.38M
2mo
NVIDIA
Free Endpoint
gliner-pii
GLiNER PII detects Personally Identifiable Information in text.
PII Detection
+1
154K
2mo
NVIDIA
Free Endpoint
cosmos-transfer2.5-2b
Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
Synthetic Data Generation
+4
2mo
NVIDIA
Downloadable
llama-nemotron-embed-vl-1b-v2
Multimodal question-answer retrieval representing user queries as text and documents as images.
nemo retriever
+3
6.65M
2mo
NVIDIA
Free Endpoint
nemotron-content-safety-reasoning-4b
A context‑aware safety model that applies reasoning to enforce domain‑specific policies.
NeMo Guardrails
+3
241K
3mo
NVIDIA
Downloadable
cosmos-reason2-8b
Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
video understanding
+8
381K
4mo
NVIDIA
Downloadable
nemoretriever-page-elements-v3
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+5
12.84K
4mo
NVIDIA
Downloadable
nemotron-3-nano-30b-a3b
Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
MoE
+3
9.77M
4mo