Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Contact

Explore

⌘KCtrl+K

6 results for

Filters (2)

API Endpoint

Download Available

Enterprise

Launchable

Use Case

Image-to-Text

Image Generation

Optical Character Recognition

Object Detection

Text-to-Image

Publisher

NVIDIA

Google

Qwen

Black forest labs

Microsoft

Blueprint Type

NVIDIA AI

NVIDIA Isaac GR00T

NVIDIA Omniverse

Labels (2)

image

VLM

Sort By

Qwen

qwen3.5-397b-a17b

Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.

Model

MoE

4.66M

NVIDIA

visual-changenet

Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask

Model

image

592

NVIDIA

cosmos-nemotron-34b

Multi-modal vision-language model that understands text/img/video and creates informative responses

Model

VLM

Google

paligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

Model

image

324K

NVIDIA

retail-object-detection

EfficientDet-based object detection network to detect 100 specific retail objects from an input video.

Model

Object Detection

778

NVIDIA

ocdrnet

OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.

Model

Optical Character Recognition

785

Items per page

of 1 pages