NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters (1)

  • Free Endpoint
    3
  • Partner Endpoint
    2
  • Download Available
    22
  • Retrieval Augmented Generation
    12
  • Text-to-Embedding
    7
  • Object Detection
    6
  • Optical Character Recognition
    3
  • Code Generation
    0
  • Deep Infra
    2
  • Together AI
    0
  • GMI Cloud
    0
  • Bitdeer AI
    0
  • CoreWeave
    0
  • NVIDIA
    23
  • Baidu
    1
  • University at Buffalo
    1
  • Mistral AI
    0
  • Meta
    0
  • Enterprise
    0
  • NVIDIA BioNemo
    0
  • Nemo retriever
  • 25 models
    NVIDIA
    Downloadable

    llama-nemotron-rerank-vl-1b-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    nemo retriever
    659
    2w
    NVIDIA
    Downloadable

    nemotron-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Table Extraction
    1.18M
    1mo
    NVIDIA
    Downloadable

    llama-nemotron-rerank-1b-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    nemo retriever
    111K
    1mo
    NVIDIA
    Downloadable

    nemotron-table-structure-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    18.46K
    1mo
    NVIDIA
    Downloadable

    nemotron-graphic-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    18.25K
    1mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-1b-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Text-to-Embedding
    1.37M
    1mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-vl-1b-v2

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    nemo retriever
    8.28M
    2mo
    NVIDIA
    Downloadable

    nemoretriever-page-elements-v3

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    78.21K
    3mo
    NVIDIA
    Downloadable

    llama-3_2-nemoretriever-300m-embed-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Text-to-Embedding
    146
    6mo
    NVIDIA
    Downloadable

    nemoretriever-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Table Extraction
    169K
    8mo
    NVIDIA
    Free Endpoint

    llama-3_2-nemoretriever-300m-embed-v1

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Text-to-Embedding
    272K
    8mo
    NVIDIA
    Downloadable

    nemoretriever-ocr

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Table Extraction
    23.48K
    8mo
    NVIDIA
    Downloadable

    llama-3.2-nemoretriever-500m-rerank-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    nemo retriever
    6.87K
    9mo
    NVIDIA
    Deprecation in 8dDownloadable

    llama-3.2-nemoretriever-1b-vlm-embed-v1

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    nemo retriever
    200K
    9mo
    NVIDIA
    Free Endpoint

    nv-embedcode-7b-v1

    The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.
    nemo retriever
    201K
    10mo
    NVIDIA
    Downloadable

    nemoretriever-table-structure-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    27.28K
    1y
    NVIDIA
    Downloadable

    nemoretriever-graphic-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    4.71K
    1y
    NVIDIA
    Downloadable

    nemoretriever-page-elements-v2

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    70.4K
    1y
    NVIDIA
    Downloadable

    nemoretriever-parse

    Cutting-edge vision-language model exceling in retrieving text and metadata from images.
    optical character recognition
    59.73K
    10mo
    NVIDIA
    Downloadable

    llama-3.2-nv-embedqa-1b-v2

    Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.
    nemo retriever
    2.4M
    8mo
    NVIDIA
    Downloadable

    llama-3.2-nv-rerankqa-1b-v2

    Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.
    nemo retriever
    175K
    8mo
    University at Buffalo
    Free Endpoint

    cached

    Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.
    nemo retriever
    156
    1y
    NVIDIA
    Downloadable

    nv-yolox-page-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    2.2K
    9mo
    Baidu
    Downloadable

    paddleocr

    Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
    Optical Character Recognition
    277K
    9mo
    Items per page
    of 2 pages