Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

9 results for

Filters

  • Download Available
    6
  • Optical Character Recognition
    2
  • NVIDIA
    8
  • Baidu
    1
  • AI Engineer
    2
  • Application Developer
    2
  • Developer
    2
  • Ml Engineer
    2
  • Data Scientist
    1
  • AI And Machine Learning
    3
  • A100 PG509 200
    1
  • A100 SXM4 80GB
    1
  • A10G
    1
  • B200
    1
  • H100 80GB HBM3
    1
  • TAO Toolkit
    2
  • Video Search and Summarization (VSS)
    1
  • NVIDIA
    Downloadable

    nemoretriever-ocr

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Table Extraction
    8.96K
    10mo
    Items per page
    of 1 pages
    NVIDIA
    Downloadable

    nemotron-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Table Extraction
    341K
    3mo
    Baidu
    Downloadable

    paddleocr

    Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
    Model
    Optical Character Recognition
    201K
    11mo
    NVIDIA
    Downloadable

    nemoretriever-parse

    Cutting-edge vision-language model exceling in retrieving text and metadata from images.
    Model
    optical character recognition
    85.79K
    1y
    NVIDIA
    Downloadable

    nemotron-parse

    Cutting-edge vision-language model exceling in retrieving text and metadata from images.
    Model
    text and table extraction
    218K
    7mo
    NVIDIA
    Downloadable

    nv-yolox-page-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    191
    11mo

    Use to call the VIOS REST API (sensor list, timelines, clip extraction, snapshots, add/delete sensors and streams). Not for VLM inference or search.
    Skill
    Developer
    453
    5d

    CLIP vision-language model for image-text retrieval, zero-shot classification, embedding extraction, ONNX export, and TensorRT deployment. Use when fine-tuning or training CLIP, running zero-shot classification, computing image embeddings, or deploying CL
    Skill
    AI Engineer
    263
    5d

    SegFormer for semantic segmentation. Lightweight transformer-based architecture with hierarchical feature extraction, efficient for real-time segmentation tasks. Use when training, evaluating, exporting, quantizing, or running inference for a TAO SegForme
    Skill
    Developer
    265
    5d