Skip to main content
Explore
Models
Skills
Blueprints
GPUs
Docs
Search
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
77 models
Sort By
dateCreated:DESC
Most Recent
NVIDIA
Downloadable
qwen-image-edit-nvpcb-ovsl2sl
An image edit model specialized for Omniverse synthetic to photographic solder-light style captured at NVIDIA PCB inspection stations
Synthetic Data Generation
+2
1d
Items per page
24
1
1
2
2
3
3
4
4
of 4 pages
NVIDIA
Downloadable
nemotron-ocr-v2
Nemotron OCR v2 is a state-of-the-art multilingual text recognition model designed for robust end-to-end optical character recognition (OCR) on complex real-world images.
Table Extraction
+4
295K
10d
NVIDIA
Downloadable
Free Endpoint
nemotron-3-ultra-550b-a55b
Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
Agent
+4
8M
1mo
NVIDIA
Downloadable
Free Endpoint
nemotron-3.5-content-safety
Multilingual, multimodal model for detecting unsafe and toxic content.
llm safety
+3
2M
1mo
NVIDIA
Free Endpoint
cosmos3-nano
Generates physics-aware videos from text prompts or an image prompt for physical AI development.
autonomous vehicles
+5
2K
1mo
NVIDIA
Downloadable
Free Endpoint
cosmos3-nano-reasoner
Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
video understanding
+8
2K
1mo
NVIDIA
Downloadable
Free Endpoint
nemotron-3-nano-omni-30b-a3b-reasoning
Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.
Image-to-Text
+4
8M
2mo
NVIDIA
Downloadable
Relighting
Re-illuminate people in video to match target lighting from a 360 HDRI environment map.
HDRI
+3
227
2mo
NVIDIA
Free Endpoint
nemotron-3-content-safety
Multilingual, multimodal model for detecting unsafe and toxic content.
llm safety
+3
230K
2mo
NVIDIA
Downloadable
Free Endpoint
synthetic-video-detector
NVIDIA Synthetic Video Detector is an AI-powered micro-service for detecting AI‑generated (synthetic) videos.
broadcast
+4
90K
2mo
NVIDIA
Downloadable
Free Endpoint
Active Speaker Detection
Detect and track speaker identities across video frames.
broadcast
+7
473
2mo
NVIDIA
Downloadable
LipSync
Generative lip dubbing that syncs lips in a video to input audio.
broadcast
+9
2mo
NVIDIA
Downloadable
Free Endpoint
ising-calibration-1-35b-a3b
Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.
Quantum
+3
332K
2mo
NVIDIA
Downloadable
llama-nemotron-rerank-vl-1b-v2
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
nemo retriever
+2
84K
3mo
NVIDIA
Free Endpoint
nemotron-voicechat
Nemotron 3 Voicechat
English
+2
2K
3mo
NVIDIA
Downloadable
nemotron-asr-streaming
Real-time speech recognition for English
Automatic Speech Recognition
+2
9K
3mo
NVIDIA
Downloadable
nemotron-ocr-v1
Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
Table Extraction
+4
341K
3mo
NVIDIA
Downloadable
Free Endpoint
nemotron-3-super-120b-a12b
Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
MoE
+4
60M
3mo
NVIDIA
Downloadable
llama-nemotron-rerank-1b-v2
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
nemo retriever
+2
501K
4mo
NVIDIA
Downloadable
nemotron-table-structure-v1
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+5
157K
4mo
NVIDIA
Downloadable
nemotron-page-elements-v3
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+4
433K
4mo
NVIDIA
Downloadable
nemotron-graphic-elements-v1
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+5
40K
4mo
NVIDIA
Downloadable
llama-nemotron-embed-1b-v2
Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
Text-to-Embedding
+2
4M
4mo
NVIDIA
Free Endpoint
gliner-pii
GLiNER PII detects Personally Identifiable Information in text.
PII Detection
+1
243K
4mo