NVIDIA
Explore
Models
Blueprints
GPUs
Docs

Deploy Models Now with NVIDIA NIM

Optimized inference for the world’s leading models
Free serverless APIs for developmentAccelerated by DGX Cloud
Self-Host on your GPU infrastructure
Continuous vulnerability fixes
ExploreModelsBlueprintsGPUsDocsForums

models

  • Reasoning
  • Vision
  • Visual Design
  • Retrieval
  • Speech
  • Biology
  • Simulation
  • Climate & Weather
  • Safety & Moderation
  • Run on RTX

industries

  • Automotive
  • Gaming
  • Healthcare
  • Industrial
  • Robotics

Retrieval

Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2025 NVIDIA Corporation

Reranking Models

Improve information retrieval accuracy with world-class NVIDIA NeMo Retriever models for reranking retrieved enterprise data to improve answer relevancy.

Run Anywhere

nvidiallama-3.2-nv-rerankqa-1b-v2

Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.

Deprecation in 70 days

nvidianv-rerankqa-mistral-4b-v3

Multilingual text reranking model.

Embedding Models

Power enterprise search and question answering with NVIDIA NeMo Retriever models—embed text and scanned documents, including PDFs and images, for fast, multilingual, multimodal data retrieval.

PREVIEW

nvidiallama-3_2-nemoretriever-300m-embed-v1

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

PREVIEW

nvidiallama-3.2-nemoretriever-1b-vlm-embed-v1

Multimodal question-answer retrieval representing user queries as text and documents as images.

Run Anywhere

nvidiallama-3.2-nv-embedqa-1b-v2

Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.

Deprecation in 70 days

nvidianv-embedqa-mistral-7b-v2

Multilingual text question-answering retrieval, transforming textual information into dense vector representations.

Run Anywhere

nvidianv-embedqa-e5-v5

English text embedding model for question-answering retrieval.

Deprecation in 70 days

snowflakearctic-embed-l

Optimized community model for text embedding.

Explore NVIDIA Blueprints

Accelerate AI application development with ready-to-use workflows powered by NVIDIA NIM and NeMo microservices for RAG, AI agents, video search and summarization, and more.

Enterprise

nvidiaBuild an Enterprise RAG pipeline

Continuously extract, embed, and index multimodal data for fast, accurate semantic search. Built on world-class NeMo Retriever models, the RAG blueprint connects AI applications to multimodal enterprise data wherever it resides.

nvidiaBuild an AI Virtual Assistant

Create intelligent virtual assistants for customer service across every industry

Enterprise

nvidiaBuild a Video Search and Summarization (VSS) Agent

Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A

Enterprise

nvidiaBuild an AI Agent for Enterprise Research

Build a custom deep researcher powered by state-of-the-art models that continuously process and synthesize multimodal enterprise data, enabling reasoning, planning, and refinement to generate comprehensive reports.

llamaindexDocument Research Assistant for Blog Creation

Automate research, and generate blogs with AI Agents using LlamaIndex and Llama3.3-70B NIM LLM.

crewaiCode Documentation for Software Development

Document your github repositories with AI Agents using CrewAI and Llama3.3 70B NIM.

Extraction Models

Accelerate, large-scale extraction from massive collections of multimodal data—text, images, and complex documents—for rapid, context-aware insights across your enterprise.

Run Anywhere

nvidianemoretriever-ocr-v1

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Run Anywhere

nvidianv-yolox-page-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Run Anywhere

nvidianemoretriever-parse

Cutting-edge vision-language model exceling in retrieving text and metadata from images.

Run Anywhere

nvidianemoretriever-page-elements-v2

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Deprecated

googledeplot

Translate images of plots into tables with one-shot visual language understanding.

Run Anywhere

nvidianemoretriever-table-structure-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Run Anywhere

nvidianemoretriever-graphic-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.