Skip to main content

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters

Free Endpoint

34

Partner Endpoint

11

Download Available

57

Use Case

Speech-to-Text

9

Retrieval Augmented Generation

8

Synthetic Data Generation

5

Text-to-Embedding

4

Object Detection

4

Inference Providers

Deep Infra

9

Together AI

3

Bitdeer AI

2

Lightning AI

2

Digital Ocean

2

Publisher

NVIDIA

74

OpenAI

1

Meta

0

Mistral AI

0

Qwen

0

NIM Container GPUs

H100 80GB HBM3

4

B200

3

H200

3

L40S

3

A100 SXM4 80GB

3

75 models

Sort By

DownloadableFree Endpoint

nemotron-3-ultra-550b-a55b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

Items per page

of 4 pages

7.73M

5d

Free Endpoint

nemotron-3.5-content-safety

Multilingual, multimodal model for detecting unsafe and toxic content.

337K

7d

Free Endpoint

cosmos3-nano

Generates physics-aware videos from text prompts or an image prompt for physical AI development.

autonomous vehicles

1.79K

9d

DownloadableFree Endpoint

cosmos3-nano-reasoner

Vision language model that excels in understanding the physical world using structured reasoning on videos or images.

video understanding

1.94K

9d

DownloadableFree Endpoint

nemotron-3-nano-omni-30b-a3b-reasoning

Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.

7.54M

1mo

Downloadable

Relighting

Re-illuminate people in video to match target lighting from a 360 HDRI environment map.

227

1mo

Free Endpoint

nemotron-3-content-safety

Multilingual, multimodal model for detecting unsafe and toxic content.

230K

1mo

DownloadableFree Endpoint

synthetic-video-detector

NVIDIA Synthetic Video Detector is an AI-powered micro-service for detecting AI‑generated (synthetic) videos.

90.31K

1mo

DownloadableFree Endpoint

Active Speaker Detection

Detect and track speaker identities across video frames.

473

1mo

Downloadable

LipSync

Generative lip dubbing that syncs lips in a video to input audio.

1mo

DownloadableFree Endpoint

ising-calibration-1-35b-a3b

Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.

332K

1mo

Downloadable

llama-nemotron-rerank-vl-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

84.41K

2mo

Free Endpoint

nemotron-voicechat

Nemotron 3 Voicechat

1.77K

2mo

Downloadable

nemotron-asr-streaming

Real-time speech recognition for English

Automatic Speech Recognition

8.68K

2mo

Downloadable

nemotron-ocr-v1

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Table Extraction

341K

2mo

DownloadableFree Endpoint

nemotron-3-super-120b-a12b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

60.41M

3mo

Downloadable

llama-nemotron-rerank-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

501K

3mo

Downloadable

nemotron-table-structure-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

157K

3mo

Downloadable

nemotron-page-elements-v3

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

433K

3mo

Downloadable

nemotron-graphic-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

39.78K

3mo

Downloadable

llama-nemotron-embed-1b-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Text-to-Embedding

4.45M

3mo

Free Endpoint

gliner-pii

GLiNER PII detects Personally Identifiable Information in text.

243K

3mo

Free Endpoint

cosmos-transfer2.5-2b

Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.

Synthetic Data Generation

3mo

Downloadable

llama-nemotron-embed-vl-1b-v2

Multimodal question-answer retrieval representing user queries as text and documents as images.

7.59M

4mo