Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
55 models
Sort By
dateCreated:DESC
Most Recent
Mistral AI
mistral-large-3-675b-instruct-2512
A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
language generation
+4
4.43M
3mo
Mistral AI
ministral-14b-instruct-2512
A general purpose VLM ideal for chat and instruction based use cases
language generation
+4
3.26M
3mo
NVIDIA
nemotron-nano-12b-v2-vl
Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
language generation
+4
1.47M
4mo
Google
gemma-3n-e4b-it
An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
language generation
+3
598K
7mo
Google
gemma-3n-e2b-it
An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
language generation
+3
521K
7mo
Mistral AI
mistral-nemotron
Built for agentic workflows, this model excels in coding, instruction following, and function calling
language generation
+3
578K
8mo
Utter-project
eurollm-9b-instruct
State-of-the-art, multilingual model tailored to all 24 official European Union languages.
Sovereign AI
+5
5.85K
376K
8mo
Gotocompany
gemma-2-9b-cpt-sahabatai-instruct
SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
Sovereign AI
+4
376K
8mo
Mistral AI
mistral-small-3.1-24b-instruct-2503
Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
language generation
+3
1.05M
9mo
Mistral AI
mistral-medium-3-instruct
Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
language generation
+4
3.35M
7mo
Meta
llama-4-maverick-17b-128e-instruct
A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
language generation
+4
2.2M
7mo
Meta
llama-4-scout-17b-16e-instruct
A multimodal, multilingual 16 MoE model with 17B parameters.
language generation
+4
295K
7mo
Google
gemma-3-27b-it
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Vision Assistant
+4
5.07M
9mo
Google
gemma-3-1b-it
A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
Translation
+3
5.25K
412K
9mo
Microsoft
phi-4-mini-instruct
Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
chat
+3
1.58M
9mo
Microsoft
phi-4-multimodal-instruct
Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
Speech Recognition
+5
372K
9mo
Tiiuae
falcon3-7b-instruct
Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities
Coding
+6
400K
9mo
Qwen
qwen2.5-7b-instruct
Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
Chinese Language Generation
+3
1.41M
9mo
NVIDIA
nemotron-4-mini-hindi-4b-instruct
A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
Indic
+3
381K
9mo
Institute of Science Tokyo
llama-3.1-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
Sovereign AI
+3
372K
9mo
Institute of Science Tokyo
llama-3.1-swallow-8b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
Sovereign AI
+3
383K
9mo
Meta
llama-3.2-3b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
chat
+3
10.98K
558K
9mo
Meta
llama-3.2-1b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
chat
+3
15.37K
331K
9mo
Qwen
qwen2-7b-instruct
Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
Chinese Language Generation
+3
512K
9mo
Items per page
24
1
1
2
2
3
3
of 3 pages