Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
58 models
Sort By
dateCreated:DESC
Most Recent
Mistral AI
mistral-large-3-675b-instruct-2512
A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
language generation
+4
3mo
Mistral AI
ministral-14b-instruct-2512
A general purpose VLM ideal for chat and instruction based use cases
language generation
+4
3mo
NVIDIA
nemotron-nano-12b-v2-vl
Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
language generation
+4
4mo
Speakleash
bielik-11b-v2.6-instruct
State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.
Polish
+4
5mo
Google
gemma-3n-e4b-it
An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
language generation
+3
7mo
Google
gemma-3n-e2b-it
An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
language generation
+3
7mo
Mistral AI
mistral-nemotron
Built for agentic workflows, this model excels in coding, instruction following, and function calling
language generation
+3
8mo
Utter-project
eurollm-9b-instruct
State-of-the-art, multilingual model tailored to all 24 official European Union languages.
Sovereign AI
+5
8mo
Gotocompany
gemma-2-9b-cpt-sahabatai-instruct
SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
Sovereign AI
+4
8mo
Mistral AI
mistral-small-3.1-24b-instruct-2503
Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
language generation
+3
9mo
Mistral AI
mistral-medium-3-instruct
Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
language generation
+4
7mo
Meta
llama-4-maverick-17b-128e-instruct
A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
language generation
+4
7mo
Meta
llama-4-scout-17b-16e-instruct
A multimodal, multilingual 16 MoE model with 17B parameters.
language generation
+4
7mo
Google
gemma-3-27b-it
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Vision Assistant
+4
9mo
Google
gemma-3-1b-it
A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
Translation
+3
9mo
Microsoft
phi-4-mini-instruct
Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
chat
+3
9mo
Microsoft
phi-4-multimodal-instruct
Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
Speech Recognition
+5
9mo
Tiiuae
falcon3-7b-instruct
Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities
Coding
+6
9mo
Qwen
qwen2.5-7b-instruct
Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
Chinese Language Generation
+3
9mo
Qwen
qwen2.5-coder-32b-instruct
Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
code completion
+3
8mo
Qwen
qwen2.5-coder-7b-instruct
Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.
code completion
+3
9mo
NVIDIA
nemotron-4-mini-hindi-4b-instruct
A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
Indic
+3
9mo
Institute of Science Tokyo
llama-3.1-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
Sovereign AI
+3
9mo
Institute of Science Tokyo
llama-3.1-swallow-8b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
Sovereign AI
+3
9mo
Items per page
24
1
1
2
2
3
3
of 3 pages