Try NVIDIA NIM APIs

Explore

Models

Skills

Blueprints

17 results for

Filters

Free Endpoint

Partner Endpoint

Download Available

Use Case

Image-to-Text

Inference Providers

OpenRouter

Deepinfra

GMI Cloud

Together AI

Bitdeer

Publisher

Mistral AI

NVIDIA

DeepSeek AI

Minimaxai

Stepfun ai

NIM Container GPUs

B200

H200

H100 80GB HBM3

GB200

L40S

Sort By

Mistral AI

DownloadableFree Endpoint

mistral-medium-3.5-128b

A high performing model for text generation, coding and agentic use cases

Model

coding

2mo

Items per page

of 1 pages

Stepfun-ai

Free Endpoint

step-3.5-flash

200B open-source reasoning engine with sparse MoE powering frontier agentic AI.

Model

Agentic

12M

4mo

DeepSeek AI

DownloadableFree Endpoint

deepseek-v4-pro

DeepSeek V4 scales to 1M-token context windows with efficient MoE architecture for coding tasks.

Model

Moe

2mo

Z.ai

DownloadableFree Endpoint

glm-5.1

GLM-5.1 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.

Model

Agentic AI

32M

2mo

Stepfun-ai

DownloadableFree Endpoint

step-3.7-flash

A sparse MoE multimodal reasoning model good for enterprise, agentic and coding tasks.

Model

Coding

1mo

Minimaxai

Free Endpoint

minimax-m3

MiniMax M3 Preview is a multimodal MoE vision-language model with strong reasoning, coding, and tool-calling capabilities.

Model

coding

17d

Sarvamai

DownloadableFree Endpoint

sarvam-m

Multilingual, hybrid-reasoning model optimized for Indian language tasks, programming, mathematical reasoning capabilities.

Model

coding

236K

11mo

Google

DownloadableFree Endpoint

gemma-4-31b-it

Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.

Model

reasoning

2mo

DeepSeek AI

DownloadableFree Endpoint

deepseek-v4-flash

DeepSeek V4 Flash is a 284B MoE model with 1M-token context optimized for fast coding and agents.

Model

MoE

15M

2mo

Minimaxai

DownloadableFree Endpoint

minimax-m2.7

MiniMax M2.7 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.

Model

reasoning

14M

2mo

Moonshotai

DownloadableFree Endpoint

kimi-k2.6

1T multimodal MoE for long-horizon coding, agentic tool use, and image/video understanding.

Model

Multimodal

15M

1mo

Mistral AI

Free Endpoint

mistral-nemotron

Built for agentic workflows, this model excels in coding, instruction following, and function calling

Model

language generation

Mistral AI

DownloadableFree Endpoint

mistral-small-4-119b-2603

Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context

Model

code generation

13M

3mo

Qwen

DownloadableFree Endpoint

qwen3.5-122b-a10b

122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.

Model

tool calling

10M

3mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-nano-30b-a3b

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more

Model

MoE

13M

6mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-super-120b-a12b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

Model

MoE

60M

3mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-ultra-550b-a55b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

Model

Agent

25d