Try NVIDIA NIM APIs

Skip to main content

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

45 results for

Filters

Free Endpoint

16

Partner Endpoint

9

Download Available

25

Launchable

6

Developer Example

4

Enterprise Blueprint

2

NemoClaw Blueprint

1

Use Case

Retrieval Augmented Generation

5

Object Detection

3

Text-to-Embedding

3

Image-to-Text

2

Optical Character Recognition

2

Inference Providers

Deepinfra

7

Bitdeer

4

OpenRouter

3

Digital Ocean

2

Lightning AI

2

Publisher

NVIDIA

44

Mistral AI

1

Audience

AI Engineer

5

Developer

5

Ml Engineer

3

Application Developer

2

Data Scientist

2

Blueprint Type

NVIDIA AI

7

Domain

AI And Machine Learning

5

NIM Container GPUs

A100 SXM4 80GB

3

A10G

3

H100 80GB HBM3

3

H100 NVL

3

H200

3

Library

Nemotron

3

Riva

2

Sort By

Free Endpoint

nemotron-voicechat

Nemotron 3 Voicechat

2K

4mo

Routes NVIDIA Nemotron Speech (Riva) NIM tasks — deploys, runs, and tests ASR, TTS, and NMT NIMs on build.nvidia.com or self-hosted.

2K

1mo

Items per page

of 2 pages

Plan, configure, and chain repo-native Nemotron customization steps into single-step or multi-step pipelines: curation, translation, SFT/PEFT (AutoModel or Megatron-Bridge), pretraining/CPT, RL alignment (DPO/RLVR/GRPO/RLHF), BYOB/MCQ benchmarks, checkpoi

2K

1mo

Free Endpoint

mistral-nemotron

Built for agentic workflows, this model excels in coding, instruction following, and function calling

language generation

1M

1y

Downloadable

nemotron-parse

Cutting-edge vision-language model exceling in retrieving text and metadata from images.

text and table extraction

1M

8mo

General

LaunchableDeveloper Example

Nemotron Voice Agent

Build Real-Time, Multimodal Voice Agents with NVIDIA Nemotron NIM.

4mo

Free Endpoint

nemotron-3-embed-1b

1B embedding model for semantic search, retrieval, and RAG applications.

Nemotron Retriever

4d

Generates BYO custom safety policies for NVIDIA Nemotron content-safety guardrails — Nemotron-Content-Safety-Reasoning-4B (text) and multimodal Nemotron-3-Content-Safety. Produces a Markdown policy, JSON taxonomy, and drop-in inference prompts. Maps rough

2K

1mo

Use when planning, debugging, tuning, evaluating, exporting, or deploying public Nemotron `embed`/`rerank` retrieval recipes.

2K

1mo

Downloadable

nemotron-ocr-v2

Nemotron OCR v2 is a state-of-the-art multilingual text recognition model designed for robust end-to-end optical character recognition (OCR) on complex real-world images.

Table Extraction

338K

26d

Orchestration skill for NVIDIA Nemotron Speech (Riva) / NeMo ASR domain and language adaptation. Given a goal like "improve/fine-tune ASR for my domain or language", it scopes the task, picks the cheapest sufficient path (word boosting → n-gram LM → fine-

Today

DownloadableFree Endpoint

nemotron-3.5-content-safety

Multilingual, multimodal model for detecting unsafe and toxic content.

2M

1mo

Downloadable

nemotron-asr-streaming

Real-time speech recognition for English

Automatic Speech Recognition

6K

4mo

Downloadable

nemotron-graphic-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

40K

4mo

Free Endpoint

nemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

2M

1y

Downloadable

nemotron-ocr-v1

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Table Extraction

341K

4mo

Downloadable

nemotron-page-elements-v3

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

433K

4mo

Downloadable

nemotron-table-structure-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

157K

4mo

DownloadableFree Endpoint

nemotron-nano-12b-v2-vl

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.

language generation

5M

8mo

Downloadable

llama-nemotron-embed-1b-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Text-to-Embedding

4M

4mo

Downloadable

llama-nemotron-rerank-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

501K

4mo

DownloadableFree Endpoint

nemotron-3-nano-30b-a3b

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more

12M

7mo

DownloadableFree Endpoint

nemotron-3-super-120b-a12b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

60M

4mo

DownloadableFree Endpoint

nemotron-3-ultra-550b-a55b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

52M

1mo