Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Contact

Explore

⌘KCtrl+K

12 results for

Filters

Free Endpoint

Partner Endpoint

Download Available

Use Case

Retrieval Augmented Generation

Text-to-Embedding

Image-to-Text

Inference Providers

Deep Infra

Together AI

Bitdeer AI

Publisher

NVIDIA

rerank-qa-mistral-4b

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

Model

Ranking

Items per page

of 1 pages

322K

BAAI

Downloadable

bge-m3

Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.

Model

Embeddings

9.9M

llama-3.2-90b-vision-instruct

Cutting-edge vision-Language model exceling in high-quality reasoning from images.

Model

Image-Text Retrieval

1.38M

11mo

llama-3.2-11b-vision-instruct

Cutting-edge vision-language model exceling in high-quality reasoning from images.

Model

Image-Text Retrieval

908K

11mo

NVIDIA

Downloadable

nv-embedqa-e5-v5

English text embedding model for question-answering retrieval.

Model

Embedding

9.24M

9mo

NVIDIA

Downloadable

llama-3.2-nv-rerankqa-1b-v2

Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.

Model

nemo retriever

103K

9mo

NVIDIA

Free Endpoint

llama-3_2-nemoretriever-300m-embed-v1

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Model

Text-to-Embedding

344K

9mo

NVIDIA

Downloadable

llama-3_2-nemoretriever-300m-embed-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Model

Text-to-Embedding

6mo

NVIDIA

Downloadable

llama-nemotron-embed-1b-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Model

Text-to-Embedding

1.95M

1mo

NVIDIA

Downloadable

llama-nemotron-embed-vl-1b-v2

Multimodal question-answer retrieval representing user queries as text and documents as images.

Model

nemo retriever

7.17M

2mo

NVIDIA

Downloadable

llama-3.2-nv-embedqa-1b-v2

Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.

Model

nemo retriever

2.52M

9mo

NVIDIA

Free Endpoint

nv-embedcode-7b-v1

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

Model

nemo retriever

157K

11mo