Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters (1)
10 models
Sort By
dateCreated:DESC
Most Recent
Z.ai
Free Endpoint
glm-4.7
GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
Tool Calling
+3
687K
3d
NVIDIA
Free Endpoint
nemotron-3-content-safety
Multilingual, multimodal model for detecting unsafe and toxic content.
llm safety
+3
2.91K
3d
Z.ai
Free Endpoint
glm-4.7
GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
Tool Calling
+3
16.34M
2mo
NVIDIA
Free Endpoint
llama-3.1-nemotron-safety-guard-8b-v3
Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs
content moderation
+4
121K
5mo
Opengpt-x
Deprecated
Downloadable
teuken-7b-instruct-commercial-v0.4
Multilingual 7B LLM, instruction-tuned on all 24 EU languages for stable, culturally aligned output.
sovereign ai
+4
58.76K
8mo
Sarvamai
Downloadable
sarvam-m
Multilingual, hybrid-reasoning model optimized for Indian language tasks, programming, mathematical reasoning capabilities.
coding
+5
165K
8mo
Mistral AI
Free Endpoint
magistral-small-2506
High performance reasoning model optimized for efficiency and edge deployment
coding
+3
1.26M
9mo
NVIDIA
Downloadable
magpie-tts-multilingual
Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.
TTS
+4
36.23K
9mo
OpenAI
Downloadable
whisper-large-v3
Robust Speech Recognition via Large-Scale Weak Supervision.
ASR
+8
72.54K
1y
Mistral AI
Deprecated
Downloadable
mistral-small-24b-instruct
Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.
code
+3
189K
9mo
Items per page
24
1
1
of 1 pages