Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters (1)
12 models
Sort By
dateCreated:DESC
Most Recent
Meta
Free Endpoint
llama-4-maverick-17b-128e-instruct
A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
chat
+4
3.66M
8mo
Meta
Downloadable
Free Endpoint
llama-4-scout-17b-16e-instruct
A multimodal, multilingual 16 MoE model with 17B parameters.
language generation
+4
64.89K
8mo
Institute of Science Tokyo
Downloadable
llama-3.1-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
chat
+4
530K
10mo
Institute of Science Tokyo
Downloadable
llama-3.1-swallow-8b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
chat
+4
537K
10mo
Meta
Downloadable
llama-3.2-3b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
chat
+4
24.64K
776K
10mo
Meta
Downloadable
llama-3.2-1b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
chat
+4
15.86K
327K
10mo
Yen-Ting Lin
Downloadable
llama-3-taiwan-70b-instruct
Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.
regional language generation
+4
543K
10mo
TokyoTech-LLM
Downloadable
llama-3-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
chat
+3
539K
10mo
Meta
Downloadable
llama-3.1-70b-instruct
Powers complex conversations with superior contextual understanding, reasoning and text generation.
chat
+4
8.02M
9mo
Meta
Downloadable
llama-3.1-8b-instruct
Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
chat
+5
5.6M
8mo
Meta
Downloadable
llama3-70b-instruct
Powers complex conversations with superior contextual understanding, reasoning and text generation.
chat
+5
877K
10mo
Meta
Downloadable
llama3-8b-instruct
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
chat
+5
1.13M
10mo
Items per page
24
1
1
of 1 pages