Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
7 models
Sort By
dateCreated:DESC
Most Recent
Meta
llama-3.2-3b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
chat
+3
12.44K
634K
9mo
Meta
llama-3.2-1b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
chat
+3
14.99K
408K
9mo
Meta
llama-3.1-70b-instruct
Powers complex conversations with superior contextual understanding, reasoning and text generation.
chat
+3
6.85M
8mo
Meta
llama-3.1-8b-instruct
Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
chat
+4
4.84M
8mo
NVIDIA
llama3-chatqa-1.5-8b
Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.
text-to-text
+2
498K
9mo
Meta
llama3-70b-instruct
Powers complex conversations with superior contextual understanding, reasoning and text generation.
chat
+4
833K
9mo
Meta
llama3-8b-instruct
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
chat
+4
1.11M
9mo
Items per page
24
1
1
of 1 pages