Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
25 models
Sort By
dateCreated:DESC
Most Recent
NVIDIA
llama-3.3-nemotron-super-49b-v1.5
High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
chat
+4
4.61M
7mo
NVIDIA
llama-3.1-nemotron-nano-vl-8b-v1
Multi-modal vision-language model that understands text/img and creates informative responses
doc intelligence
+3
7.51M
8mo
NVIDIA
llama-3.1-nemotron-nano-4b-v1.1
State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents
edge
+4
98.4K
8mo
NVIDIA
llama-3.1-nemotron-ultra-253b-v1
Superior inference efficiency with highest accuracy for scientific and complex math reasoning, coding, tool calling, and instruction following.
chat
+4
7.36M
8mo
Meta
llama-4-maverick-17b-128e-instruct
A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
language generation
+4
3.01M
7mo
Meta
llama-4-scout-17b-16e-instruct
A multimodal, multilingual 16 MoE model with 17B parameters.
language generation
+4
210K
7mo
NVIDIA
llama-3.3-nemotron-super-49b-v1
High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
chat
+4
1.09M
7mo
NVIDIA
llama-3.1-nemotron-nano-8b-v1
Leading reasoning and agentic AI accuracy model for PC and edge.
chat
+4
592K
8mo
DeepSeek AI
deepseek-r1-distill-llama-8b
Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
Distillation
+5
4.6M
8mo
Meta
llama-3.3-70b-instruct
Advanced LLM for reasoning, math, general knowledge, and function calling
Reasoning
+5
23.06M
8mo
Institute of Science Tokyo
llama-3.1-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
Sovereign AI
+3
461K
9mo
Institute of Science Tokyo
llama-3.1-swallow-8b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
Sovereign AI
+3
472K
9mo
Meta
llama-3.2-3b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
chat
+3
12.44K
634K
9mo
Meta
llama-3.2-11b-vision-instruct
Cutting-edge vision-language model exceling in high-quality reasoning from images.
Image-Text Retrieval
+5
676K
9mo
Meta
llama-3.2-90b-vision-instruct
Cutting-edge vision-Language model exceling in high-quality reasoning from images.
Image-Text Retrieval
+5
579K
9mo
Meta
llama-3.2-1b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
chat
+3
14.99K
408K
9mo
Abacus.AI
dracarys-llama-3.1-70b-instruct
Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
chat
+2
572K
9mo
Yen-Ting Lin
llama-3-taiwan-70b-instruct
Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.
regional language generation
+3
469K
9mo
TokyoTech-LLM
llama-3-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
Large Language Model
+2
473K
9mo
Meta
llama-3.1-405b-instruct
Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.
synthetic data generation
+2
2.6M
11mo
Meta
llama-3.1-70b-instruct
Powers complex conversations with superior contextual understanding, reasoning and text generation.
chat
+3
6.85M
8mo
Meta
llama-3.1-8b-instruct
Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
chat
+4
4.84M
8mo
NVIDIA
llama3-chatqa-1.5-8b
Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.
text-to-text
+2
498K
9mo
Meta
llama3-70b-instruct
Powers complex conversations with superior contextual understanding, reasoning and text generation.
chat
+4
833K
9mo
Items per page
24
1
1
2
2
of 2 pages