Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
8 results for
Filters
Models (6)
Blueprints (1)
Other (1)
Sort By
score:DESC
Best Match
NVIDIA
Launchable
Enterprise
Build a Video Search and Summarization (VSS) Agent
Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
Blueprint
vision
+6
2w
NVIDIA
cosmos-reason1-7b
Reasoning vision language model (VLM) for physical AI and robotics.
Model
video understanding
+8
15.93K
6mo
NVIDIA
cosmos-reason2-8b
Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
Model
video understanding
+8
194K
2mo
NVIDIA
ocdrnet
OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.
Model
Optical Character Recognition
+7
798
1y
Google
paligemma
Vision language model adept at comprehending text and visual inputs to produce informative responses
Model
image
+8
327K
1y
NVIDIA
retail-object-detection
EfficientDet-based object detection network to detect 100 specific retail objects from an input video.
Model
Object Detection
+7
794
1y
DGX Spark
1 HR
Vision-Language Model Fine-tuning
Fine-tune Vision-Language Models for image and video understanding tasks using Qwen2.5-VL and InternVL3
Playbook
DGX
+6
5mo
NVIDIA
visual-changenet
Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask
Model
image
+8
615
1y
Items per page
24
1
1
of 1 pages