NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

28 results for

Filters (2)

  • Free Endpoint
    2
  • Partner Endpoint
    2
  • Download Available
    21
  • Launchable
    5
  • Enterprise
    1
  • Retrieval Augmented Generation
    11
  • Text-to-Embedding
    6
  • Object Detection
    6
  • Optical Character Recognition
    3
  • Drug Discovery
    0
  • Deep Infra
    2
  • Together AI
    0
  • Bitdeer AI
    0
  • GMI Cloud
    0
  • CoreWeave
    0
  • NVIDIA
    26
  • Baidu
    1
  • Cyborg
    1
  • Meta
    0
  • Mistral AI
    0
  • NVIDIA AI
    4
  • NVIDIA Omniverse
    0
  • NVIDIA BioNemo
    0
  • NVIDIA Isaac GR00T
    0
  • A100 SXM4 80GB
    0
  • B200
    0
  • GB200
    0
  • GH200 144G HBM3e
    0
  • H100 80GB HBM3
    0
  • NeMo
  • nemo retriever
  • NVIDIA
    LaunchableEnterprise

    Build an Enterprise RAG Pipeline Blueprint

    Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.
    Blueprint
    NVIDIA AI
    2mo
    Items per page
    of 2 pages
    Cyborg
    Launchable

    Cyborg Enterprise RAG

    Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.
    Blueprint
    NIM
    2mo
    NVIDIA
    Downloadable

    llama-3.2-nemoretriever-500m-rerank-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    Model
    nemo retriever
    21.18K
    10mo
    NVIDIA
    Downloadable

    llama-3.2-nv-embedqa-1b-v2

    Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.
    Model
    nemo retriever
    2.47M
    9mo
    NVIDIA
    Downloadable

    llama-3.2-nv-rerankqa-1b-v2

    Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.
    Model
    nemo retriever
    136K
    9mo
    NVIDIA
    Free Endpoint

    llama-3_2-nemoretriever-300m-embed-v1

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Model
    Text-to-Embedding
    650K
    9mo
    NVIDIA
    Downloadable

    llama-3_2-nemoretriever-300m-embed-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Model
    Text-to-Embedding
    97
    7mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-1b-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Model
    Text-to-Embedding
    37.05M
    2mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-vl-1b-v2

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    Model
    nemo retriever
    6.24M
    2mo
    NVIDIA
    Downloadable

    llama-nemotron-rerank-1b-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    Model
    nemo retriever
    168K
    2mo
    NVIDIA
    Downloadable

    llama-nemotron-rerank-vl-1b-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    Model
    nemo retriever
    70.59K
    1mo
    NVIDIA
    Launchable

    Multi-Agent Intelligent Warehouse

    An AI-powered, multi-agent system designed to optimize warehouse operations through intelligent automation, real-time monitoring, and natural language interaction.
    Blueprint
    NVIDIA AI
    2mo
    NVIDIA
    Downloadable

    nemoretriever-graphic-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    4.29K
    1y
    NVIDIA
    Downloadable

    nemoretriever-ocr

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Table Extraction
    7.28K
    9mo
    NVIDIA
    Downloadable

    nemoretriever-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Table Extraction
    2.65M
    9mo
    NVIDIA
    Downloadable

    nemoretriever-page-elements-v2

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    126K
    1y
    NVIDIA
    Downloadable

    nemoretriever-page-elements-v3

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    7.27K
    4mo
    NVIDIA
    Downloadable

    nemoretriever-parse

    Cutting-edge vision-language model exceling in retrieving text and metadata from images.
    Model
    optical character recognition
    124K
    11mo
    NVIDIA
    Downloadable

    nemoretriever-table-structure-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    24.81K
    1y
    NVIDIA
    Downloadable

    nemotron-graphic-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    20.87K
    2mo
    NVIDIA
    Downloadable

    nemotron-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Table Extraction
    932K
    1mo
    NVIDIA
    Downloadable

    nemotron-table-structure-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    17.89K
    2mo
    NVIDIA
    Free Endpoint

    nv-embedcode-7b-v1

    The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.
    Model
    nemo retriever
    138K
    11mo
    NVIDIA
    Downloadable

    nv-embedqa-e5-v5

    English text embedding model for question-answering retrieval.
    Model
    Embedding
    21.01M
    9mo