Try NVIDIA NIM APIs

Explore

Models

Skills

Blueprints

13 results for

Filters (1)

Free Endpoint

Partner Endpoint

Download Available

Use Case

Image-to-Text

Retrieval Augmented Generation

Object Detection

Text-to-Embedding

Optical Character Recognition

Inference Providers

Deepinfra

Bitdeer

Digital Ocean

Lightning AI

CoreWeave

Publisher

NVIDIA

Mistral AI

NIM Container GPUs

A100 SXM4 80GB

A10G

H100 80GB HBM3

H100 NVL

H200

Labels (1)

Chat

Sort By

NVIDIA

Free Endpoint

nemotron-voicechat

Nemotron 3 Voicechat

Model

English

1.77K

3mo

Items per page

of 1 pages

Mistral AI

Free Endpoint

mistral-nemotron

Built for agentic workflows, this model excels in coding, instruction following, and function calling

Model

language generation

1.49M

NVIDIA

Free Endpoint

nemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

Model

Chat

1.53M

NVIDIA

DownloadableFree Endpoint

nemotron-nano-12b-v2-vl

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.

Model

language generation

2.47M

7mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-nano-30b-a3b

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more

Model

MoE

11.91M

6mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-super-120b-a12b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

Model

MoE

60.41M

3mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-ultra-550b-a55b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

Model

Agent

7.73M

15d

NVIDIA

DownloadableFree Endpoint

nvidia-nemotron-nano-9b-v2

High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.

Model

thinking budget

988K

10mo

NVIDIA

DownloadableFree Endpoint

nemotron-3-nano-omni-30b-a3b-reasoning

Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.

Model

Image-to-Text

7.54M

1mo

NVIDIA

DownloadableFree Endpoint

llama-3.1-nemotron-nano-8b-v1

Leading reasoning and agentic AI accuracy model for PC and edge.

Model

advanced reasoning

1.47M

11mo

NVIDIA

DownloadableFree Endpoint

llama-3.1-nemotron-nano-vl-8b-v1

Multi-modal vision-language model that understands text/img and creates informative responses

Model

doc intelligence

10.15M

11mo

NVIDIA

DownloadableFree Endpoint

llama-3.3-nemotron-super-49b-v1

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

Model

advanced reasoning

4.93M

11mo

NVIDIA

DownloadableFree Endpoint

llama-3.3-nemotron-super-49b-v1.5

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

Model

advanced reasoning

3.17M

10mo