⌘KCtrl+K

Your Privacy Choices

Contact

Explore

Models

⌘KCtrl+K

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (1)

Free Endpoint

Partner Endpoint

Download Available

Use Case

Retrieval Augmented Generation

Text-to-Embedding

Drug Discovery

Image-to-Text

Code Generation

Inference Providers

Deep Infra

Together AI

Bitdeer AI

GMI Cloud

CoreWeave

Publisher

NVIDIA

BAAI

llama-nemotron-rerank-vl-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

nemo retriever

Items per page

of 1 pages

70.59K

1mo

NVIDIA

Downloadable

llama-nemotron-rerank-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

nemo retriever

168K

2mo

NVIDIA

Downloadable

llama-nemotron-embed-1b-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Text-to-Embedding

37.05M

2mo

NVIDIA

Downloadable

llama-nemotron-embed-vl-1b-v2

Multimodal question-answer retrieval representing user queries as text and documents as images.

nemo retriever

6.24M

2mo

NVIDIA

Downloadable

llama-3_2-nemoretriever-300m-embed-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Text-to-Embedding

7mo

NVIDIA

Free Endpoint

llama-3_2-nemoretriever-300m-embed-v1

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Text-to-Embedding

650K

9mo

NVIDIA

Downloadable

llama-3.2-nemoretriever-500m-rerank-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

nemo retriever

21.18K

10mo

NVIDIA

Free Endpoint

nv-embedcode-7b-v1

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

nemo retriever

138K

11mo

NVIDIA

Downloadable

llama-3.2-nv-embedqa-1b-v2

Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.

nemo retriever

2.47M

9mo

NVIDIA

Downloadable

llama-3.2-nv-rerankqa-1b-v2

Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.

nemo retriever

136K

9mo

NVIDIA

Downloadable

nv-embedqa-e5-v5

English text embedding model for question-answering retrieval.

Embedding

21.01M

9mo

NVIDIA

Free Endpoint

nv-embed-v1

Generates high-quality numerical embeddings from text inputs.

Non-Commercial Use Only

3.05M

9mo

BAAI

Downloadable

bge-m3

Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.

Embeddings

3.89M

NVIDIA

Free Endpoint

rerank-qa-mistral-4b

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

Ranking

339K