Try NVIDIA NIM APIs

Explore

Models

Skills

Blueprints

30 results for

Filters

Free Endpoint

Partner Endpoint

Download Available

Use Case

Retrieval Augmented Generation

Object Detection

Image-to-Text

Text-to-Embedding

Optical Character Recognition

Inference Providers

Deepinfra

Together AI

Bitdeer

Digital Ocean

Lightning AI

Publisher

NVIDIA

Mistral AI

NIM Container GPUs

A100 SXM4 80GB

A10G

H100 80GB HBM3

H100 NVL

H200

Sort By

NVIDIA

Free Endpoint

nemotron-voicechat

Nemotron 3 Voicechat

Model

English

1.77K

3mo

Items per page

of 2 pages

NVIDIA

Downloadable

nemotron-parse

Cutting-edge vision-language model exceling in retrieving text and metadata from images.

Model

text and table extraction

218K

7mo

Mistral AI

Free Endpoint

mistral-nemotron

Built for agentic workflows, this model excels in coding, instruction following, and function calling

Model

language generation

1.49M

NVIDIA

Free Endpoint

nemotron-3-content-safety

Multilingual, multimodal model for detecting unsafe and toxic content.

Model

llm safety

230K

2mo

NVIDIA

Free Endpoint

nemotron-3.5-content-safety

Multilingual, multimodal model for detecting unsafe and toxic content.

Model

llm safety

337K

16d

NVIDIA

Downloadable

nemotron-asr-streaming

Real-time speech recognition for English

Model

Automatic Speech Recognition

8.88K

3mo

NVIDIA

Downloadable

nemotron-graphic-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Model

Object Detection

39.78K

3mo

NVIDIA

Free Endpoint

nemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

Model

Chat

1.53M

NVIDIA

Downloadable

nemotron-ocr-v1

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Model

Table Extraction

341K

3mo

NVIDIA

Downloadable

nemotron-page-elements-v3

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Model

Object Detection

433K

3mo

NVIDIA

Downloadable

nemotron-table-structure-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Model

Object Detection

157K

3mo

NVIDIA

Free Endpoint

nemotron-content-safety-reasoning-4b

A context‑aware safety model that applies reasoning to enforce domain‑specific policies.

Model

NeMo Guardrails

145K

4mo

NVIDIA

DownloadableFree Endpoint

nemotron-nano-12b-v2-vl

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.

Model

language generation

2.47M

7mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-nano-30b-a3b

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more

Model

MoE

11.91M

6mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-super-120b-a12b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

Model

MoE

60.41M

3mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-ultra-550b-a55b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

Model

Agent

7.73M

14d

NVIDIA

Downloadable

llama-nemotron-embed-1b-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Model

Text-to-Embedding

4.45M

3mo

NVIDIA

Downloadable

llama-nemotron-rerank-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

Model

nemo retriever

501K

3mo

NVIDIA

DownloadableFree Endpoint

nvidia-nemotron-nano-9b-v2

High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.

Model

thinking budget

988K

10mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-nano-omni-30b-a3b-reasoning

Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.

Model

Image-to-Text

7.54M

1mo

NVIDIA

DownloadableFree Endpoint

llama-3.1-nemotron-nano-8b-v1

Leading reasoning and agentic AI accuracy model for PC and edge.

Model

advanced reasoning

1.47M

11mo

NVIDIA

DownloadableFree Endpoint

llama-3.1-nemotron-nano-vl-8b-v1

Multi-modal vision-language model that understands text/img and creates informative responses

Model

doc intelligence

10.15M

11mo

NVIDIA

Free Endpoint

llama-3.1-nemotron-safety-guard-8b-v3

Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs

Model

content moderation

336K

7mo

NVIDIA

DownloadableFree Endpoint

llama-3.3-nemotron-super-49b-v1

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

Model

advanced reasoning

4.93M

11mo