NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
DiscoverModelsBlueprintsGPUsDocsForums

workstations

  • Run on RTX
  • Run on Spark
  • Run on Station

models

  • Reasoning
  • Vision
  • Visual Design
  • Retrieval
  • Speech
  • Biology
  • Simulation
  • Climate & Weather
  • Safety & Moderation

industries

  • Automotive
  • Financial Services
  • Gaming
  • Healthcare
  • Industrial
  • Robotics

Retrieval

Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Deploy Models Now with NVIDIA NIM

Optimized inference for the world’s leading models
Free serverless APIs for development
Accelerated by DGX Cloud
Self-Host on your GPU infrastructure
Continuous vulnerability fixes

Embedding Models

Power enterprise search and question answering with NVIDIA Nemotron RAG models by embedding text and scanned documents, including PDFs and images, for fast, multilingual, multimodal data retrieval.

NVIDIA
Downloadable

llama-nemotron-embed-vl-1b-v2

Multimodal question-answer retrieval representing user queries as text and documents as images.
embedding
2mo
NVIDIA
Downloadable

llama-nemotron-embed-1b-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
NeMo Retriever
1mo

Extraction Models

Accelerate large-scale extraction from massive collections of multimodal data - text, images, and complex documents - with NVIDIA Nemotron RAG models for rapid, context-aware insights across your enterprise.

NVIDIA
Downloadable

nv-yolox-page-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Chart Detection
9mo
NVIDIA
Downloadable

nemoretriever-parse

Cutting-edge vision-language model exceling in retrieving text and metadata from images.
data ingestion
10mo
NVIDIA
Downloadable

nemoretriever-page-elements-v2

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Chart Detection
1y
NVIDIA
Deprecation in 27dDownloadable

nemoretriever-graphic-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Chart Detection
1y

nvidiaBuild an Enterprise RAG Pipeline Blueprint

Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.

NIMNeMo RetrieverNemotronRetrieval-Augmented Generation

nvidiaBuild an AI Virtual Assistant

Create intelligent virtual assistants for customer service across every industry

Customer ServiceRetrieval-augmented generationcontact centerllm

nvidiaBuild a Video Search and Summarization (VSS) Agent

Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A

chatgenerative AIvideo-to-textvision
Enterprise

nvidiaNVIDIA AI-Q Blueprint for intelligent agents

AI agents that connect, retrieve, and reason on enterprise data—making information accessible, actionable, and intelligent.

AgentsEnterpriseNIMNeMoNemotron

crewaiCode Documentation for Software Development

Document your github repositories with AI Agents using CrewAI and Llama3.3 70B NIM.

AI AgentsCode DocumentationCrewAIPartner

Explore NVIDIA Blueprints

Accelerate AI application development with ready-to-use workflows powered by NVIDIA NIM and NeMo microservices for RAG, AI agents, video search and summarization, and more.

Reranking Models

Improve information retrieval accuracy with world-class NVIDIA Nemotron RAG models by reranking retrieved enterprise data to improve answer relevancy.

NVIDIA
Downloadable

llama-nemotron-rerank-vl-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
nemo retriever
3w
NVIDIA
Downloadable

llama-nemotron-rerank-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
nemo retriever
1mo