Generate Embeddings for Text Retrieval

The best embedding models to connect chat-based LLMs with your proprietary enterprise data

nvidia / embed-qa-4PREVIEW

GPU-accelerated generation of text embeddings used for question-answering retrieval.

Embeddings
Retrieval Augmented Generation
nvidia / nv-embed-v1PREVIEW

Generates high-quality numerical embeddings from text inputs.

Embeddings
Retrieval Augmented Generation
snowflake / arctic-embed-lPREVIEW

GPU-accelerated generation of text embeddings.

Embeddings
Retrieval Augmented Generation
baai / bge-m3PREVIEW

Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.

Embeddings
Retrieval Augmented Generation

Reranking Models

Identify the right chunks of data from your diverse business data to improve accuracy of responses

new york
PREVIEW
nvidia
rerank-qa-mistral-4b
ranking
retrieval augmented generation
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.