⌘KCtrl+K

Your Privacy Choices

Contact

Explore

Models

⌘KCtrl+K

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (1)

API Endpoint

Download Available

Use Case

Object Detection

Image-to-Text

Image Generation

Optical Character Recognition

Image-to-Embedding

Publisher

NVIDIA

Google

nv-dinov2

NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.

computer vision

1.18M

11mo

NVIDIA

API Endpoint

nv-grounding-dino

Grounding dino is an open vocabulary zero-shot object detection model.

Object Detection

3.6K

11mo

NVIDIA

Downloadable

nvclip

NV-CLIP is a multimodal embeddings model for image and text.

Computer vision

23.65K

9mo

NVIDIA

API Endpoint

ocdrnet

OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.

Optical Character Recognition

736

NVIDIA

API Endpoint

visual-changenet

Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask

image

640

NVIDIA

API Endpoint

retail-object-detection

EfficientDet-based object detection network to detect 100 specific retail objects from an input video.

Object Detection

363

Google

API Endpoint

paligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

image

335K

Items per page

of 1 pages