Try NVIDIA NIM APIs

Downloadable

llama-nemotron-embed-1b-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Text-to-Embedding

38.73M

2mo

Items per page

of 1 pages

Downloadable

llama-nemotron-rerank-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

383K

2mo

Downloadable

nemoretriever-ocr

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Table Extraction

14.25K

10mo

Downloadable

nemoretriever-ocr-v1

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Table Extraction

2.07M

9mo

Downloadable

nemotron-ocr-v1

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Table Extraction

336K

2mo

Downloadable

llama-nemotron-embed-vl-1b-v2

Multimodal question-answer retrieval representing user queries as text and documents as images.

7.15M

3mo

Downloadable

llama-nemotron-rerank-vl-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

120K

1mo

Downloadable

nemoretriever-graphic-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

8.09K

Downloadable

nemoretriever-page-elements-v3

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

3.19K

5mo

Downloadable

nemoretriever-parse

Cutting-edge vision-language model exceling in retrieving text and metadata from images.

optical character recognition

125K

11mo

Downloadable

nemoretriever-table-structure-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

71.26K

Downloadable

nemotron-graphic-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

39.24K

2mo

Downloadable

nemotron-table-structure-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

122K

2mo

Free Endpoint

nv-embedcode-7b-v1

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

224K

12mo

Cyborg

Deprecation in 25dLaunchable

Cyborg Enterprise RAG

Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.

NIM

3mo

General

LaunchableEnterprise

Build an Enterprise RAG Pipeline Blueprint

Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.

3mo

Retail

LaunchableDeveloper Example

Multi-Agent Intelligent Warehouse

An AI-powered, multi-agent system designed to optimize warehouse operations through intelligent automation, real-time monitoring, and natural language interaction.

3mo

Retail

LaunchableDeveloper Example

Retail Shopping Assistant

Elevate Shopping Experiences Online and In Stores.

3mo

Media

LaunchableDeveloper Example

Streaming Data to RAG

Sensor-captured radio enables real-time awareness, AI-driven analytics for actionable, searchable insights.

3mo

General

LaunchableEnterprise

NVIDIA AI-Q Blueprint for intelligent agents

AI agents that connect, retrieve, and reason on enterprise data—making information accessible, actionable, and intelligent.