⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (2)

Free Endpoint

1

Partner Endpoint

1

Download Available

2

Use Case

Retrieval Augmented Generation

3

Text-to-Embedding

2

Drug Discovery

0

Image-to-Text

0

Code Generation

0

Inference Providers

Deep Infra

1

Together AI

0

GMI Cloud

0

Bitdeer AI

0

CoreWeave

0

Publisher

NVIDIA

3

Meta

0

Mistral AI

0

Qwen

0

Google

0

NIM Container GPUs

A100 SXM4 80GB

0

B200

0

GB200

0

GH200 144G HBM3e

0

H100 80GB HBM3

0

Labels (2)

Embedding

nemo retriever

3 models

Sort By

Downloadable

llama-nemotron-embed-vl-1b-v2

Multimodal question-answer retrieval representing user queries as text and documents as images.

Items per page

of 1 pages

6.85M

3mo

Free Endpoint

nv-embedcode-7b-v1

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

206K

11mo

Downloadable

nv-embedqa-e5-v5

English text embedding model for question-answering retrieval.

32.08M

10mo