Build Your First RAG Pipeline

The getting started kit to connect chat-based reasoning models with your proprietary enterprise data

nvidia / nv-embed-v1PREVIEW

Generates high-quality numerical embeddings from text inputs.

Embeddings
Retrieval Augmented Generation
snowflake / arctic-embed-lPREVIEW

GPU-accelerated generation of text embeddings.

Embeddings
Retrieval Augmented Generation
nvidia / rerank-qa-mistral-4bPREVIEW

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

Ranking
Retrieval Augmented Generation
baai / bge-m3PREVIEW

Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.

Embeddings
Retrieval Augmented Generation