NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

6 results for

Filters (2)

  • API Endpoint
    5
  • Download Available
    1
  • Enterprise
    0
  • Launchable
    0
  • Image-to-Text
    1
  • Image Generation
    1
  • Optical Character Recognition
    1
  • Object Detection
    1
  • Text-to-Image
    0
  • NVIDIA
    4
  • Google
    1
  • Qwen
    1
  • Black forest labs
    0
  • Microsoft
    0
  • NVIDIA AI
    0
  • NVIDIA Isaac GR00T
    0
  • NVIDIA Omniverse
    0
  • image
  • VLM
  • Qwen

    qwen3.5-397b-a17b

    Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
    Model
    MoE
    4.66M
    2w
    NVIDIA

    visual-changenet

    Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask
    Model
    image
    592
    1y
    NVIDIA

    cosmos-nemotron-34b

    Multi-modal vision-language model that understands text/img/video and creates informative responses
    Model
    VLM
    6
    1y
    Google

    paligemma

    Vision language model adept at comprehending text and visual inputs to produce informative responses
    Model
    image
    324K
    1y
    NVIDIA

    retail-object-detection

    EfficientDet-based object detection network to detect 100 specific retail objects from an input video.
    Model
    Object Detection
    778
    1y
    NVIDIA

    ocdrnet

    OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.
    Model
    Optical Character Recognition
    785
    1y
    Items per page
    of 1 pages