Try NVIDIA NIM APIs

Skip to main content

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

22 results for

Filters

Free Endpoint

1

Partner Endpoint

2

Download Available

14

Launchable

6

Developer Example

3

Enterprise Blueprint

2

Use Case

Retrieval Augmented Generation

6

Object Detection

3

Optical Character Recognition

3

Text-to-Embedding

3

Inference Providers

Deepinfra

2

Publisher

NVIDIA

20

Baidu

1

Cyborg

1

Audience

AI Engineer

1

Data Engineer

1

Developer

1

Blueprint Type

NVIDIA AI

5

Domain

AI And Machine Learning

1

NIM Container GPUs

A100 PG509 200

1

A100 SXM4 80GB

1

A10G

1

B200

1

H100 80GB HBM3

1

Library

NeMo Retriever

1

Sort By

Use when the user wants to search, query, extract, transcribe, describe, quote, filter, or aggregate across documents — PDFs, scanned forms / images (`.jpg` `.png` `.tiff`), Office (`.docx` `.pptx`), text (`.html` `.txt`), audio (`.mp3` `.wav` `.m4a`), or

1K

1mo

Downloadable

llama-nemotron-embed-1b-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Text-to-Embedding

Items per page

of 1 pages

4M

4mo

Downloadable

llama-nemotron-rerank-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

501K

4mo

Downloadable

nemotron-ocr-v2

Nemotron OCR v2 is a state-of-the-art multilingual text recognition model designed for robust end-to-end optical character recognition (OCR) on complex real-world images.

Table Extraction

338K

16d

Downloadable

llama-nemotron-embed-vl-1b-v2

Multimodal question-answer retrieval representing user queries as text and documents as images.

8M

5mo

Downloadable

llama-nemotron-rerank-vl-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

84K

3mo

Downloadable

nemoretriever-ocr

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Table Extraction

9K

11mo

Downloadable

nemotron-ocr-v1

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Table Extraction

341K

4mo

Downloadable

nemoretriever-page-elements-v2

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

243K

1y

Downloadable

nemoretriever-parse

Cutting-edge vision-language model exceling in retrieving text and metadata from images.

optical character recognition

86K

1y

Downloadable

nemotron-graphic-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

40K

4mo

Downloadable

nemotron-table-structure-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

157K

4mo

Free Endpoint

nv-embedcode-7b-v1

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

1M

1y

Deprecation in 5dLaunchable

Cyborg Enterprise RAG

Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.

4mo

Downloadable

nv-embedqa-e5-v5

English text embedding model for question-answering retrieval.

16M

11mo

General

LaunchableEnterprise

Build an Enterprise RAG Pipeline Blueprint

Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.

4mo

Retail

LaunchableDeveloper Example

Multi-Agent Intelligent Warehouse

An AI-powered, multi-agent system designed to optimize warehouse operations through intelligent automation, real-time monitoring, and natural language interaction.

4mo

Downloadable

nv-yolox-page-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

191

1y

Retail

LaunchableDeveloper Example

Retail Shopping Assistant

Elevate Shopping Experiences Online and In Stores.

4mo

Media

LaunchableDeveloper Example

Streaming Data to RAG

Sensor-captured radio enables real-time awareness, AI-driven analytics for actionable, searchable insights.

4mo

Downloadable

paddleocr

Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.

Optical Character Recognition

201K

1y

General

LaunchableEnterprise

NVIDIA AI-Q Blueprint for intelligent agents

AI agents that connect, retrieve, and reason on enterprise data—making information accessible, actionable, and intelligent.

4mo