Try NVIDIA NIM APIs

Explore

Models

Skills

Blueprints

14 results for

Filters (1)

Free Endpoint

Partner Endpoint

Download Available

Developer Example

Use Case

Image-to-Text

Inference Providers

OpenRouter

Deepinfra

Together AI

GMI Cloud

Bitdeer

Publisher

NVIDIA

Mistral AI

Minimaxai

DeepSeek AI

Stepfun ai

Audience

AI Engineer

Developer

Application Developer

DevOps Engineer

Ml Engineer

Domain

Developer Tools

NIM Container GPUs

B200

H200

H100 80GB HBM3

GB200

L40S

Library

NeMo RL

NeMoClaw

Labels (1)

reasoning

Sort By

Mistral AI

DownloadableFree Endpoint

mistral-medium-3.5-128b

A high performing model for text generation, coding and agentic use cases

Model

coding

2mo

Items per page

of 1 pages

Stepfun-ai

Free Endpoint

step-3.5-flash

200B open-source reasoning engine with sparse MoE powering frontier agentic AI.

Model

Agentic

12M

4mo

DeepSeek AI

DownloadableFree Endpoint

deepseek-v4-pro

DeepSeek V4 scales to 1M-token context windows with efficient MoE architecture for coding tasks.

Model

Moe

2mo

Z.ai

DownloadableFree Endpoint

glm-5.1

GLM-5.1 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.

Model

Agentic AI

32M

2mo

Minimaxai

Free Endpoint

minimax-m3

MiniMax M3 Preview is a multimodal MoE vision-language model with strong reasoning, coding, and tool-calling capabilities.

Model

coding

17d

Sarvamai

DownloadableFree Endpoint

sarvam-m

Multilingual, hybrid-reasoning model optimized for Indian language tasks, programming, mathematical reasoning capabilities.

Model

coding

236K

11mo

Google

DownloadableFree Endpoint

gemma-4-31b-it

Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.

Model

reasoning

2mo

Minimaxai

DownloadableFree Endpoint

minimax-m2.7

MiniMax M2.7 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.

Model

reasoning

14M

2mo

Moonshotai

DownloadableFree Endpoint

kimi-k2.6

1T multimodal MoE for long-horizon coding, agentic tool use, and image/video understanding.

Model

Multimodal

15M

1mo

Mistral AI

DownloadableFree Endpoint

mistral-small-4-119b-2603

Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context

Model

code generation

13M

3mo

Qwen

DownloadableFree Endpoint

qwen3.5-122b-a10b

122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.

Model

tool calling

10M

3mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-nano-30b-a3b

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more

Model

MoE

13M

6mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-super-120b-a12b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

Model

MoE

60M

3mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-ultra-550b-a55b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

Model

Agent

25d