Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters (1)
10 models
Sort By
dateCreated:DESC
Most Recent
Meta
Free Endpoint
llama-4-maverick-17b-128e-instruct
A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
chat
+4
5.93M
8mo
Meta
Downloadable
Free Endpoint
llama-4-scout-17b-16e-instruct
A multimodal, multilingual 16 MoE model with 17B parameters.
language generation
+4
24.25K
8mo
Institute of Science Tokyo
Downloadable
llama-3.1-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
chat
+4
486K
10mo
Institute of Science Tokyo
Downloadable
llama-3.1-swallow-8b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
chat
+4
487K
10mo
Meta
Downloadable
llama-3.2-3b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
chat
+4
31.02K
960K
10mo
Meta
Downloadable
llama-3.2-1b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
chat
+4
16.31K
309K
10mo
Yen-Ting Lin
Downloadable
llama-3-taiwan-70b-instruct
Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.
regional language generation
+4
498K
10mo
TokyoTech-LLM
Downloadable
llama-3-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
chat
+3
490K
10mo
Meta
Downloadable
llama-3.1-70b-instruct
Powers complex conversations with superior contextual understanding, reasoning and text generation.
chat
+4
8.88M
9mo
Meta
Downloadable
llama-3.1-8b-instruct
Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
chat
+5
9.1M
8mo
Items per page
24
1
1
of 1 pages