Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Contact

Explore

⌘KCtrl+K

38 results for

Filters

Free Endpoint

Partner Endpoint

Download Available

Use Case

Code Generation

Image-to-Text

Synthetic Data Generation

Inference Providers

Deep Infra

GMI Cloud

Bitdeer AI

Together AI

CoreWeave

Publisher

NVIDIA

nemotron-content-safety-reasoning-4b

A context‑aware safety model that applies reasoning to enforce domain‑specific policies.

Model

NeMo Guardrails

148K

4mo

Items per page

of 2 pages

NVIDIA

DownloadableFree Endpoint

nemotron-3-nano-omni-30b-a3b-reasoning

Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.

Model

Image-to-Text

8.93M

1mo

NVIDIA

DownloadableFree Endpoint

nvidia-nemotron-nano-9b-v2

High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.

Model

thinking budget

1.01M

9mo

ByteDance

Free Endpoint

seed-oss-36b-instruct

ByteDance open-source LLM with long-context, reasoning, and agentic intelligence.

Model

thinking budget

1.23M

9mo

Stepfun-ai

Free Endpoint

step-3.5-flash

200B open-source reasoning engine with sparse MoE powering frontier agentic AI.

Model

Agentic

12.01M

4mo

NVIDIA

DownloadableFree Endpoint

ising-calibration-1-35b-a3b

Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.

Model

Quantum

339K

1mo

Mistral AI

DownloadableFree Endpoint

mistral-medium-3.5-128b

A high performing model for text generation, coding and agentic use cases

Model

coding

3.46M

1mo

Sarvamai

DownloadableFree Endpoint

sarvam-m

Multilingual, hybrid-reasoning model optimized for Indian language tasks, programming, mathematical reasoning capabilities.

Model

coding

287K

10mo

Mistral AI

DownloadableFree Endpoint

mistral-small-4-119b-2603

Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context

Model

code generation

18.79M

2mo

Minimaxai

DownloadableFree Endpoint

minimax-m2.7

MiniMax M2.7 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.

Model

B200

14.2M

1mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-nano-30b-a3b

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more

Model

MoE

12M

5mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-super-120b-a12b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

Model

MoE

61.45M

2mo

NVIDIA

DownloadableFree Endpoint

An MOE LLM that follows instructions, completes requests, and generates creative text.

Model

B200

899K

10mo