⌘KCtrl+K

Your Privacy Choices

Contact

Explore

Models

⌘KCtrl+K

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (1)

API Endpoint

Download Available

Use Case

Image-to-Text

Object Detection

Image Generation

Optical Character Recognition

Code Generation

Publisher

NVIDIA

Google

Qwen

qwen3.5-397b-a17b

Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.

chat

8.02M

NVIDIA

API Endpoint

cosmos-nemotron-34b

Multi-modal vision-language model that understands text/img/video and creates informative responses

VLM

NVIDIA

API Endpoint

ocdrnet

OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.

Optical Character Recognition

736

NVIDIA

API Endpoint

visual-changenet

Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask

image

640

NVIDIA

API Endpoint

retail-object-detection

EfficientDet-based object detection network to detect 100 specific retail objects from an input video.

Object Detection

363

Google

API Endpoint

paligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

image

335K

Items per page

of 1 pages