Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
12 models
Sort By
dateCreated:DESC
Most Recent
Meta
llama-4-maverick-17b-128e-instruct
A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
language generation
+4
3.01M
7mo
Meta
llama-4-scout-17b-16e-instruct
A multimodal, multilingual 16 MoE model with 17B parameters.
language generation
+4
210K
7mo
Institute of Science Tokyo
llama-3.1-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
Sovereign AI
+3
461K
9mo
Institute of Science Tokyo
llama-3.1-swallow-8b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
Sovereign AI
+3
472K
9mo
Meta
llama-3.2-3b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
chat
+3
12.44K
634K
9mo
Meta
llama-3.2-1b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
chat
+3
14.99K
408K
9mo
Yen-Ting Lin
llama-3-taiwan-70b-instruct
Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.
regional language generation
+3
469K
9mo
TokyoTech-LLM
llama-3-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
Large Language Model
+2
473K
9mo
Meta
llama-3.1-70b-instruct
Powers complex conversations with superior contextual understanding, reasoning and text generation.
chat
+3
6.85M
8mo
Meta
llama-3.1-8b-instruct
Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
chat
+4
4.84M
8mo
Meta
llama3-70b-instruct
Powers complex conversations with superior contextual understanding, reasoning and text generation.
chat
+4
833K
9mo
Meta
llama3-8b-instruct
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
chat
+4
1.11M
9mo
Items per page
24
1
1
of 1 pages