Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Publisher
Use Case
NIM Type
Sorting by Most Recent

Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.

Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.

Efficiently refine retrieval results over multiple sources and languages.

World-class multilingual and cross-lingual question-answering retrieval.

Cutting-edge vision-language model exceling in high-quality reasoning from images.

Cutting-edge vision-Language model exceling in high-quality reasoning from images.

English text embedding model for question-answering retrieval.

Multilingual text question-answering retrieval, transforming textual information into dense vector representations.

Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.

GPU-accelerated generation of text embeddings used for question-answering retrieval.

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.