NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

11 results for

Filters

  • Download Available
    6
  • API Endpoint
    5
  • Enterprise
    0
  • Launchable
    0
  • Synthetic Data Generation
    5
  • Optical Character Recognition
    3
  • Digital Twin
    1
  • NVIDIA
    9
  • Baidu
    1
  • Microsoft
    1
  • NVIDIA AI
    0
  • NVIDIA

    nemoretriever-ocr

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Optical Character Recognition
    100K
    7mo
    NVIDIA

    nemoretriever-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Optical Character Recognition
    558K
    7mo
    NVIDIA

    llama-3.1-nemotron-nano-vl-8b-v1

    Multi-modal vision-language model that understands text/img and creates informative responses
    Model
    doc intelligence
    8.34M
    8mo
    Baidu

    paddleocr

    Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
    Model
    Optical Character Recognition
    190K
    8mo
    NVIDIA

    ocdrnet

    OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.
    Model
    Optical Character Recognition
    757
    1y
    NVIDIA

    cosmos-predict1-5b

    Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.
    Model
    Synthetic Data Generation
    29.06K
    11mo
    NVIDIA

    cosmos-reason2-8b

    Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
    Model
    video understanding
    221K
    2mo
    NVIDIA

    cosmos-transfer1-7b

    Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
    Model
    Synthetic Data Generation
    15.85K
    8mo
    NVIDIA

    cosmos-transfer2.5-2b

    Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
    Model
    Synthetic Data Generation
    1w
    Microsoft

    TRELLIS

    MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.
    Model
    text-to-3d
    5.64K
    6mo
    NVIDIA

    usdsearch

    AI-powered search for OpenUSD data, 3D models, images, and assets using text or image-based inputs.
    Model
    OpenUSD
    553
    1y
    Items per page
    of 1 pages