Explore

Models

Skills

Blueprints

GPUs

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (1)

Free Endpoint5

Partner Endpoint5

Download Available5

Use Case

Drug Discovery0Retrieval Augmented Generation0Speech-to-Text0Image Generation0Image-to-Text0

Inference Providers

OpenRouter5Deepinfra3GMI Cloud3Bitdeer3Together AI2

Publisher

NVIDIA3DeepSeek AI2Meta0Google0Mistral AI0

NIM Container GPUs

A100 SXM4 80GB0H100 80GB HBM30L40S0A10G0B2000

Labels (1)

MoE

5 models

Sort By

NVIDIA

DownloadableFree Endpoint

nemotron-3-ultra-550b-a55b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

Agent

MoE
Frontier
Reasoning
Long Context

Items per page

of 1 pages

52M API calls in the last 30 days

Last updated on June 4, 2026

DeepSeek AI

DownloadableFree Endpoint

deepseek-v4-flash

DeepSeek V4 Flash is a 284B MoE model with 1M-token context optimized for fast coding and agents.

coding

MoE
fast
agentic

17M API calls in the last 30 days

Last updated on April 24, 2026

DeepSeek AI

DownloadableFree Endpoint

deepseek-v4-pro

DeepSeek V4 scales to 1M-token context windows with efficient MoE architecture for coding tasks.

reasoning
coding
agentic

7M API calls in the last 30 days

Last updated on April 24, 2026

NVIDIA

DownloadableFree Endpoint

nemotron-3-super-120b-a12b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

Reasoning
Chat
Long Context
Instruction Following

65M API calls in the last 30 days

Last updated on March 11, 2026

NVIDIA

DownloadableFree Endpoint

nemotron-3-nano-30b-a3b

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more

Reasoning
Long Context
Instruction Following

12M API calls in the last 30 days

Last updated on December 15, 2025