Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
96 models
Sort By
dateCreated:DESC
Most Recent
NVIDIA
Enterprise
Build A Generative Protein Binder Design Pipeline
This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.
generative-virtual-screening-for-drug-discovery
+6
2.26K
5d
NVIDIA
Free Endpoint
nemotron-voicechat
Nemotron 3 Voicechat
English
+2
3.72K
1w
NVIDIA
Downloadable
nemotron-asr-streaming
Real-time speech recognition for English
Automatic Speech Recognition
+2
1.43K
2w
NVIDIA
Downloadable
nemotron-ocr-v1
Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
Table Extraction
+4
46.36K
2w
NVIDIA
Downloadable
nemotron-3-super-120b-a12b
Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
chat
+5
20.83M
2w
NVIDIA
Downloadable
llama-nemotron-rerank-1b-v2
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
nemo retriever
+2
12.18K
3w
NVIDIA
Downloadable
nemotron-table-structure-v1
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+5
12.28K
3w
NVIDIA
Downloadable
nemotron-page-elements-v3
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+4
29.65K
3w
NVIDIA
Downloadable
nemotron-graphic-elements-v1
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+5
10.45K
3w
NVIDIA
Downloadable
llama-nemotron-embed-1b-v2
Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
Text-to-Embedding
+2
806K
3w
NVIDIA
Free Endpoint
gliner-pii
GLiNER PII detects Personally Identifiable Information in text.
PII Detection
+1
167K
3w
NVIDIA
Free Endpoint
cosmos-transfer2.5-2b
Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
Synthetic Data Generation
+4
1mo
NVIDIA
Downloadable
llama-nemotron-embed-vl-1b-v2
Multimodal question-answer retrieval representing user queries as text and documents as images.
nemo retriever
+3
3.64M
1mo
NVIDIA
Free Endpoint
nemotron-content-safety-reasoning-4b
A context‑aware safety model that applies reasoning to enforce domain‑specific policies.
NeMo Guardrails
+3
558K
2mo
NVIDIA
Downloadable
cosmos-reason2-8b
Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
video understanding
+8
164K
3mo
NVIDIA
Downloadable
nemoretriever-page-elements-v3
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+5
466K
3mo
NVIDIA
Downloadable
nemotron-3-nano-30b-a3b
Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
chat
+4
13.61M
3mo
NVIDIA
Free Endpoint
riva-translate-4b-instruct-v1_1
Translation model in 12 languages with few-shots example prompts capability.
nvidia nim
+2
531K
3mo
NVIDIA
Free Endpoint
streampetr
StreamPETR offers efficient 3D object detection for autonomous driving by propagating sparse object queries temporally.
autonomous vehicles
+3
258K
4mo
NVIDIA
Downloadable
nemotron-parse
Cutting-edge vision-language model exceling in retrieving text and metadata from images.
text and table extraction
+2
520K
5mo
NVIDIA
Downloadable
nemotron-nano-12b-v2-vl
Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
chat
+4
1.22M
5mo
NVIDIA
Free Endpoint
llama-3.1-nemotron-safety-guard-8b-v3
Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs
content moderation
+4
582K
5mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-tw
Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
ASR
+4
363
5mo
NVIDIA
Downloadable
llama-3_2-nemoretriever-300m-embed-v2
Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
Text-to-Embedding
+2
6.01K
5mo
Items per page
24
1
1
2
2
3
3
4
4
of 4 pages