NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

15 results for

Filters

  • Download Available
    6
  • API Endpoint
    5
  • Enterprise
    1
  • Launchable
    1
  • Synthetic Data Generation
    5
  • Optical Character Recognition
    3
  • Digital Twin
    1
  • NVIDIA
    13
  • Baidu
    1
  • Microsoft
    1
  • NVIDIA AI
    1
  • NVIDIA

    nemoretriever-ocr

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Optical Character Recognition
    69.61K
    7mo
    NVIDIA

    nemoretriever-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Optical Character Recognition
    557K
    7mo
    NVIDIA

    llama-3.1-nemotron-nano-vl-8b-v1

    Multi-modal vision-language model that understands text/img and creates informative responses
    Model
    doc intelligence
    7.93M
    8mo
    Baidu

    paddleocr

    Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
    Model
    Optical Character Recognition
    189K
    8mo
    NVIDIA

    ocdrnet

    OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.
    Model
    Optical Character Recognition
    771
    1y
    NVIDIA
    LaunchableEnterprise

    Build a Video Search and Summarization (VSS) Agent

    Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
    Blueprint
    vision
    2w
    NVIDIA

    cosmos-predict1-5b

    Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.
    Model
    Synthetic Data Generation
    27.35K
    11mo
    NVIDIA

    cosmos-reason2-8b

    Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
    Model
    video understanding
    205K
    2mo
    NVIDIA

    cosmos-transfer1-7b

    Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
    Model
    Synthetic Data Generation
    15.84K
    8mo
    NVIDIA

    cosmos-transfer2.5-2b

    Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
    Model
    Synthetic Data Generation
    1w
    Microsoft

    TRELLIS

    MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.
    Model
    text-to-3d
    5.66K
    6mo
    NVIDIA

    usdsearch

    AI-powered search for OpenUSD data, 3D models, images, and assets using text or image-based inputs.
    Model
    OpenUSD
    541
    1y
    DGX Spark
    30 MIN

    Vibe Coding in VS Code

    Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue
    Playbook
    DGX
    5mo
    DGX Spark

    Vibe Coding in VS Code

    Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue.dev
    Playbook
    DGX
    5mo
    DGX Spark
    5 MIN

    VS Code

    Install and use VS Code locally or remotely
    Playbook
    DGX
    5mo
    Items per page
    of 1 pages