Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters (1)
8 models
Sort By
dateCreated:DESC
Most Recent
Minimaxai
Free Endpoint
minimax-m2.1
MiniMax M2.1 excels in multi-language coding, app/web dev, office AI, and agent integration
chat
+3
8.33M
1mo
Moonshotai
Downloadable
kimi-k2.5
1T multimodal MoE for high‑capacity video and image understanding with efficient inference.
Multimodal
+4
22.84M
1mo
Mistral AI
Free Endpoint
mistral-large-3-675b-instruct-2512
A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
chat
+4
6.69M
3mo
Mistral AI
Downloadable
ministral-14b-instruct-2512
A general purpose VLM ideal for chat and instruction based use cases
chat
+4
4.67M
3mo
Meta
Free Endpoint
llama-guard-4-12b
Multi-modal model to classify safety for input prompts as well output responses.
LLM Multimodal Safety
+3
495K
8mo
Mistral AI
Free Endpoint
mistral-small-3.1-24b-instruct-2503
Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
chat
+3
1.8M
9mo
Mistral AI
Free Endpoint
mistral-medium-3-instruct
Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
chat
+4
5.28M
8mo
NVIDIA
Downloadable
nvclip
NV-CLIP is a multimodal embeddings model for image and text.
Computer vision
+3
23.65K
9mo
Items per page
24
1
1
of 1 pages