Skip to main content
Explore
Models
Skills
Blueprints
GPUs
Docs
Search
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
25 models
Sort By
dateCreated:DESC
Most Recent
NVIDIA
Downloadable
llama-nemotron-rerank-vl-1b-v2
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
nemo retriever
+2
Items per page
24
1
1
2
2
of 2 pages
84.41K
2mo
NVIDIA
Downloadable
llama-nemotron-rerank-1b-v2
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
nemo retriever
+2
501K
3mo
NVIDIA
Downloadable
llama-nemotron-embed-1b-v2
Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
Text-to-Embedding
+2
4.45M
3mo
NVIDIA
Downloadable
llama-nemotron-embed-vl-1b-v2
Multimodal question-answer retrieval representing user queries as text and documents as images.
nemo retriever
+3
7.59M
4mo
NVIDIA
Downloadable
nemotron-parse
Cutting-edge vision-language model exceling in retrieving text and metadata from images.
text and table extraction
+2
218K
7mo
NVIDIA
Free Endpoint
llama-3.1-nemotron-safety-guard-8b-v3
Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs
content moderation
+4
336K
7mo
NVIDIA
Downloadable
Free Endpoint
llama-3.3-nemotron-super-49b-v1.5
High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
advanced reasoning
+3
3.17M
10mo
Meta
Free Endpoint
llama-guard-4-12b
Multi-modal model to classify safety for input prompts as well output responses.
LLM Multimodal Safety
+3
222K
11mo
NVIDIA
Downloadable
Free Endpoint
llama-3.1-nemotron-nano-vl-8b-v1
Multi-modal vision-language model that understands text/img and creates informative responses
doc intelligence
+2
10.15M
11mo
Meta
Free Endpoint
llama-4-maverick-17b-128e-instruct
A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
language generation
+3
20.32M
11mo
NVIDIA
Downloadable
Free Endpoint
llama-3.3-nemotron-super-49b-v1
High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
advanced reasoning
+3
4.93M
11mo
NVIDIA
Downloadable
Free Endpoint
llama-3.1-nemotron-nano-8b-v1
Leading reasoning and agentic AI accuracy model for PC and edge.
advanced reasoning
+3
1.47M
11mo
NVIDIA
Downloadable
nemoretriever-parse
Cutting-edge vision-language model exceling in retrieving text and metadata from images.
optical character recognition
+4
85.79K
1y
NVIDIA
Downloadable
llama-3.1-nemoguard-8b-topic-control
Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.
nemo guardrails
+4
149K
1y
NVIDIA
Downloadable
llama-3.1-nemoguard-8b-content-safety
Leading content safety model for enhancing the safety and moderation capabilities of LLMs
nemo guardrails
+4
160K
1y
Meta
Downloadable
Free Endpoint
llama-3.3-70b-instruct
Advanced LLM for reasoning, math, general knowledge, and function calling
Instruction following
+4
18.79M
1y
Meta
Downloadable
Free Endpoint
llama-3.2-3b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
Language Generation
+3
28.5K
1.22M
1y
Meta
Downloadable
Free Endpoint
llama-3.2-11b-vision-instruct
Cutting-edge vision-language model exceling in high-quality reasoning from images.
Image-Text Retrieval
+4
1.67M
1y
Meta
Downloadable
Free Endpoint
llama-3.2-90b-vision-instruct
Cutting-edge vision-Language model exceling in high-quality reasoning from images.
Image-Text Retrieval
+4
2.69M
1y
Meta
Downloadable
Free Endpoint
llama-3.2-1b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
Language Generation
+3
44.06K
290K
1y
Abacus.AI
Free Endpoint
dracarys-llama-3.1-70b-instruct
Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
Code Generation
+1
783K
1y
Meta
Free Endpoint
esm2-650m
Generates embeddings of proteins from their amino acid sequences.
nim
+4
128K
1y
Meta
Downloadable
Free Endpoint
llama-3.1-70b-instruct
Powers complex conversations with superior contextual understanding, reasoning and text generation.
Chat
+3
3.9M
1y
Meta
Downloadable
Free Endpoint
llama-3.1-8b-instruct
Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
Chat
+4
25.09M
11mo