NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters (1)

  • Free Endpoint
    2
  • Partner Endpoint
    2
  • Download Available
    22
  • Retrieval Augmented Generation
    12
  • Text-to-Embedding
    7
  • Object Detection
    6
  • Optical Character Recognition
    3
  • Code Generation
    0
  • Deep Infra
    2
  • Together AI
    0
  • Bitdeer AI
    0
  • GMI Cloud
    0
  • CoreWeave
    0
  • NVIDIA
    23
  • Baidu
    1
  • Mistral AI
    0
  • Meta
    0
  • Microsoft
    0
  • Enterprise
    0
  • NVIDIA BioNemo
    0
  • nemo retriever
  • 24 models
    NVIDIA
    Downloadable

    llama-nemotron-rerank-vl-1b-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    nemo retriever
    2.8K
    2w
    NVIDIA
    Downloadable

    nemotron-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Table Extraction
    1.7M
    1mo
    NVIDIA
    Downloadable

    llama-nemotron-rerank-1b-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    nemo retriever
    177K
    1mo
    NVIDIA
    Downloadable

    nemotron-table-structure-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    19.33K
    1mo
    NVIDIA
    Downloadable

    nemotron-graphic-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    18.6K
    1mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-1b-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Text-to-Embedding
    1.86M
    1mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-vl-1b-v2

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    nemo retriever
    9.31M
    2mo
    NVIDIA
    Downloadable

    nemoretriever-page-elements-v3

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    75.53K
    4mo
    NVIDIA
    Downloadable

    llama-3_2-nemoretriever-300m-embed-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Text-to-Embedding
    137
    6mo
    NVIDIA
    Downloadable

    nemoretriever-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Table Extraction
    687K
    8mo
    NVIDIA
    Free Endpoint

    llama-3_2-nemoretriever-300m-embed-v1

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Text-to-Embedding
    361K
    8mo
    NVIDIA
    Downloadable

    nemoretriever-ocr

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Table Extraction
    20.2K
    8mo
    NVIDIA
    Downloadable

    llama-3.2-nemoretriever-500m-rerank-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    nemo retriever
    7.09K
    9mo
    NVIDIA
    Deprecation in 5dDownloadable

    llama-3.2-nemoretriever-1b-vlm-embed-v1

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    nemo retriever
    194K
    9mo
    NVIDIA
    Free Endpoint

    nv-embedcode-7b-v1

    The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.
    nemo retriever
    206K
    10mo
    NVIDIA
    Downloadable

    nemoretriever-table-structure-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    27.14K
    1y
    NVIDIA
    Downloadable

    nemoretriever-graphic-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    4.63K
    1y
    NVIDIA
    Downloadable

    nemoretriever-page-elements-v2

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    70.81K
    1y
    NVIDIA
    Downloadable

    nemoretriever-parse

    Cutting-edge vision-language model exceling in retrieving text and metadata from images.
    optical character recognition
    105K
    10mo
    NVIDIA
    Downloadable

    llama-3.2-nv-embedqa-1b-v2

    Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.
    nemo retriever
    2.37M
    9mo
    NVIDIA
    Downloadable

    llama-3.2-nv-rerankqa-1b-v2

    Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.
    nemo retriever
    178K
    9mo
    NVIDIA
    Downloadable

    nv-yolox-page-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    1.28K
    9mo
    Baidu
    Downloadable

    paddleocr

    Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
    Optical Character Recognition
    789K
    9mo
    NVIDIA
    Downloadable

    nv-embedqa-e5-v5

    English text embedding model for question-answering retrieval.
    Embedding
    12.15M
    9mo
    Items per page
    of 1 pages