Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters (1)
9 models
Sort By
dateCreated:DESC
Most Recent
Z.ai
Free Endpoint
glm-4.7
GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
Tool Calling
+4
14.39M
2mo
NVIDIA
Free Endpoint
llama-3.1-nemotron-safety-guard-8b-v3
Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs
content moderation
+4
285K
5mo
Opengpt-x
Downloadable
teuken-7b-instruct-commercial-v0.4
Multilingual 7B LLM, instruction-tuned on all 24 EU languages for stable, culturally aligned output.
sovereign ai
+5
187K
8mo
Sarvamai
Downloadable
sarvam-m
Multilingual, hybrid-reasoning model optimized for Indian language tasks, programming, mathematical reasoning capabilities.
coding
+6
253K
8mo
Mistral AI
Free Endpoint
magistral-small-2506
High performance reasoning model optimized for efficiency and edge deployment
coding
+4
2.09M
8mo
Utter-project
Downloadable
eurollm-9b-instruct
State-of-the-art, multilingual model tailored to all 24 official European Union languages.
chat
+6
3.71K
181K
9mo
NVIDIA
Downloadable
magpie-tts-multilingual
Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.
TTS
+4
42.34K
9mo
OpenAI
Downloadable
whisper-large-v3
Robust Speech Recognition via Large-Scale Weak Supervision.
ASR
+8
73.61K
11mo
Mistral AI
Downloadable
mistral-small-24b-instruct
Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.
chat
+4
292K
9mo
Items per page
24
1
1
of 1 pages