NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

21 results for

Filters

  • Download Available
    10
  • Launchable
    7
  • Enterprise
    3
  • API Endpoint
    3
  • Retrieval Augmented Generation
    11
  • Text-to-Embedding
    8
  • Image-to-Text
    2
  • NVIDIA
    17
  • Meta
    2
  • Cyborg
    1
  • BAAI
    1
  • NVIDIA AI
    6
  • NVIDIA

    rerank-qa-mistral-4b

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    Model
    Ranking
    140K
    1y
    BAAI

    bge-m3

    Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.
    Model
    Embeddings
    1.84M
    10mo
    NVIDIA
    Launchable

    Biomedical AI-Q Research Agent Blueprint

    Build advanced AI agents within the biomedical domain using the AI-Q Blueprint and the BioNeMo Virtual Screening Blueprint
    Blueprint
    Launchable
    3w
    Cyborg
    Launchable

    Cyborg Enterprise RAG

    Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.
    Blueprint
    NIM
    3w
    NVIDIA
    Launchable

    Multi-Agent Intelligent Warehouse

    An AI-powered, multi-agent system designed to optimize warehouse operations through intelligent automation, real-time monitoring, and natural language interaction.
    Blueprint
    nemo retriever
    3w
    NVIDIA
    Launchable

    Retail Shopping Assistant

    Elevate Shopping Experiences Online and In Stores.
    Blueprint
    nemo retriever
    3w
    NVIDIA
    LaunchableEnterprise

    Build an AI Agent for Enterprise Research

    Build a custom enterprise research assistant powered by state-of-the-art models that process and synthesize multimodal data, enabling reasoning, planning, and refinement to generate comprehensive reports.
    Blueprint
    NIM
    3w
    NVIDIA
    Launchable

    Build an AI Virtual Assistant

    Create intelligent virtual assistants for customer service across every industry
    Blueprint
    Customer Service
    3w
    NVIDIA
    LaunchableEnterprise

    Build an Enterprise RAG Pipeline Blueprint

    Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.
    Blueprint
    NIM
    3w
    Meta

    llama-3.2-11b-vision-instruct

    Cutting-edge vision-language model exceling in high-quality reasoning from images.
    Model
    Image-Text Retrieval
    750K
    9mo
    Meta

    llama-3.2-90b-vision-instruct

    Cutting-edge vision-Language model exceling in high-quality reasoning from images.
    Model
    Image-Text Retrieval
    607K
    9mo
    NVIDIA

    nv-embedqa-e5-v5

    English text embedding model for question-answering retrieval.
    Model
    Embedding
    3.21M
    7mo
    NVIDIA

    llama-3.2-nemoretriever-1b-vlm-embed-v1

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    Model
    nemo retriever
    269K
    8mo
    NVIDIA

    llama-3.2-nv-rerankqa-1b-v2

    Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.
    Model
    nemo retriever
    165K
    7mo
    NVIDIA

    llama-3_2-nemoretriever-300m-embed-v1

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Model
    Retrieval Augmented Generation
    86.67K
    7mo
    NVIDIA

    llama-3_2-nemoretriever-300m-embed-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Model
    Retrieval Augmented Generation
    129K
    5mo
    NVIDIA

    llama-nemotron-embed-1b-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Model
    Retrieval Augmented Generation
    282K
    1w
    NVIDIA

    llama-nemotron-embed-vl-1b-v2

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    Model
    nemo retriever
    769K
    4w
    NVIDIA
    Enterprise

    Cosmos Dataset Search

    Accelerate post-training of end-to-end autonomous vehicle stacks with vector search and retrieval for large video datasets.
    Blueprint
    Autonomous Vehicles
    3w
    NVIDIA

    llama-3.2-nv-embedqa-1b-v2

    Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.
    Model
    nemo retriever
    6.82M
    7mo
    NVIDIA

    nv-embedcode-7b-v1

    The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.
    Model
    nemo retriever
    255K
    9mo
    Items per page
    of 1 pages