Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
11 models
Sort By
dateCreated:DESC
Most Recent
Z.ai
Downloadable
glm-5.1
GLM-5.1 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.
Agentic AI
+3
Items per page
24
1
1
of 1 pages
2.53M
1w
Z.ai
Free Endpoint
glm-4.7
GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
Tool Calling
+3
4.57M
1w
Moonshotai
Free Endpoint
kimi-k2-thinking
Open reasoning model with 256K context window, native INT4 quantization and enhanced tool use.
Conversational
+3
3.37M
4mo
Mistral AI
Free Endpoint
mistral-large-3-675b-instruct-2512
A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
language generation
+3
4.15M
4mo
Mistral AI
Downloadable
ministral-14b-instruct-2512
A general purpose VLM ideal for chat and instruction based use cases
language generation
+3
1.6M
4mo
Qwen
Free Endpoint
qwen3-coder-480b-a35b-instruct
Excels in agentic coding and browser use and supports 256K context, delivering top results.
agentic coding
+3
3.27M
7mo
NVIDIA
Free Endpoint
usdcode
State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.
Digital Twin
+4
9mo
NVIDIA
Free Endpoint
usdvalidate
Verify compatibility of OpenUSD assets with instant RTX render and rule-based validation.
Validation
+5
617
1y
NVIDIA
Free Endpoint
nv-embed-v1
Generates high-quality numerical embeddings from text inputs.
Non-Commercial Use Only
+2
3.67M
9mo
Upstage
Free Endpoint
solar-10.7b-instruct
Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.
Non-Commercial Use Only
+4
192K
1y
NVIDIA
Downloadable
vista-3d
VISTA-3D is a specialized interactive foundation model for segmenting and anotating human anatomies.
Interactive Annotation
+3
757
1y