Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
13 models
Sort By
dateCreated:DESC
Most Recent
NVIDIA
Downloadable
nemotron-3-super-120b-a12b
Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
chat
+5
8.75M
1w
NVIDIA
Downloadable
nemotron-3-nano-30b-a3b
Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
chat
+4
13M
3mo
NVIDIA
Downloadable
llama-3.3-nemotron-super-49b-v1.5
High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
chat
+4
5.37M
7mo
Mistral AI
Free Endpoint
mistral-nemotron
Built for agentic workflows, this model excels in coding, instruction following, and function calling
chat
+3
867K
9mo
IBM
Free Endpoint
granite-3.3-8b-instruct
Small language model fine-tuned for improved reasoning, coding, and instruction-following
coding
+3
137K
8mo
Gotocompany
Downloadable
gemma-2-9b-cpt-sahabatai-instruct
SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
chat
+5
536K
9mo
NVIDIA
Downloadable
llama-3.1-nemotron-ultra-253b-v1
Superior inference efficiency with highest accuracy for scientific and complex math reasoning, coding, tool calling, and instruction following.
chat
+4
8.28M
8mo
NVIDIA
Downloadable
llama-3.3-nemotron-super-49b-v1
High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
chat
+4
1.11M
8mo
NVIDIA
Downloadable
llama-3.1-nemotron-nano-8b-v1
Leading reasoning and agentic AI accuracy model for PC and edge.
chat
+4
631K
8mo
Mistral AI
Downloadable
mistral-small-24b-instruct
Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.
chat
+4
606K
8mo
Meta
Downloadable
llama-3.3-70b-instruct
Advanced LLM for reasoning, math, general knowledge, and function calling
Instruction following
+5
20.39M
9mo
Baichuan AI
Free Endpoint
baichuan2-13b-chat
Support Chinese and English chat, coding, math, instruction following, solving quizzes
Chinese Language Generation
+4
590K
10mo
Upstage
Free Endpoint
solar-10.7b-instruct
Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.
Non-Commercial Use Only
+4
555K
11mo
Items per page
24
1
1
of 1 pages