Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Filters (1)

Free Endpoint

Partner Endpoint

Download Available

Use Case

Retrieval Augmented Generation

Text-to-Embedding

Drug Discovery

Image-to-Text

Speech-to-Text

Inference Providers

Deepinfra

OpenRouter

Together AI

GMI Cloud

Lightning AI

Publisher

NVIDIA

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

4mo

Items per page

of 1 pages

Downloadable

Multimodal question-answer retrieval representing user queries as text and documents as images.

5mo

Free Endpoint

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

Downloadable

English text embedding model for question-answering retrieval.

16M

11mo