Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Contact

Explore

⌘KCtrl+K

12 results for

Filters (1)

Free Endpoint

Partner Endpoint

Download Available

Launchable

Enterprise

Use Case

Retrieval Augmented Generation

Text-to-Embedding

Image-to-Text

Inference Providers

Deep Infra

Together AI

Bitdeer AI

Publisher

NVIDIA

Cyborg

Cyborg Enterprise RAG

Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.

Blueprint

NIM

2mo

Items per page

of 1 pages

NVIDIA

Launchable

Multi-Agent Intelligent Warehouse

An AI-powered, multi-agent system designed to optimize warehouse operations through intelligent automation, real-time monitoring, and natural language interaction.

Blueprint

NVIDIA AI

2mo

NVIDIA

Launchable

Retail Shopping Assistant

Elevate Shopping Experiences Online and In Stores.

Blueprint

NVIDIA AI

2mo

NVIDIA

LaunchableEnterprise

Build an Enterprise RAG Pipeline Blueprint

Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.

Blueprint

NVIDIA AI

2mo

NVIDIA

Downloadable

nv-embedqa-e5-v5

English text embedding model for question-answering retrieval.

Model

Embedding

9.24M

9mo

NVIDIA

Downloadable

llama-3.2-nv-rerankqa-1b-v2

Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.

Model

nemo retriever

103K

9mo

NVIDIA

Free Endpoint

llama-3_2-nemoretriever-300m-embed-v1

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Model

Text-to-Embedding

344K

9mo

NVIDIA

Downloadable

llama-3_2-nemoretriever-300m-embed-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Model

Text-to-Embedding

6mo

NVIDIA

Downloadable

llama-nemotron-embed-1b-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Model

Text-to-Embedding

1.95M

1mo

NVIDIA

Downloadable

llama-nemotron-embed-vl-1b-v2

Multimodal question-answer retrieval representing user queries as text and documents as images.

Model

nemo retriever

7.17M

2mo

NVIDIA

Downloadable

llama-3.2-nv-embedqa-1b-v2

Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.

Model

nemo retriever

2.52M

9mo

NVIDIA

Free Endpoint

nv-embedcode-7b-v1

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

Model

nemo retriever

157K

11mo