⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (1)

Free Endpoint

2

Partner Endpoint

2

Download Available

7

Use Case

Retrieval Augmented Generation

9

Text-to-Embedding

9

Code Generation

0

Drug Discovery

0

Image-to-Text

0

Inference Providers

Deep Infra

2

Together AI

0

Bitdeer AI

0

GMI Cloud

0

CoreWeave

0

Publisher

NVIDIA

8

BAAI

1

Mistral AI

0

Meta

0

Microsoft

0

API Catalog Type

Enterprise

0

Blueprint Type

NVIDIA BioNemo

0

Labels (1)

Text-to-Embedding

9 models

Sort By

Downloadable

llama-nemotron-embed-1b-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Text-to-Embedding

1.86M

1mo

Downloadable

llama-nemotron-embed-vl-1b-v2

Multimodal question-answer retrieval representing user queries as text and documents as images.

9.31M

2mo

Downloadable

llama-3_2-nemoretriever-300m-embed-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Text-to-Embedding

137

6mo

Free Endpoint

llama-3_2-nemoretriever-300m-embed-v1

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Text-to-Embedding

361K

8mo

Deprecation in 5dDownloadable

llama-3.2-nemoretriever-1b-vlm-embed-v1

Multimodal question-answer retrieval representing user queries as text and documents as images.

194K

9mo

Downloadable

llama-3.2-nv-embedqa-1b-v2

Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.

2.37M

9mo

Downloadable

nv-embedqa-e5-v5

English text embedding model for question-answering retrieval.

12.15M

9mo

Free Endpoint

nv-embed-v1

Generates high-quality numerical embeddings from text inputs.

Non-Commercial Use Only

4.48M

9mo

Downloadable

bge-m3

Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.

11.99M

1y

Items per page

of 1 pages