Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
3 results for
Filters (2)
Models (2)
Blueprints (0)
Other (1)
Sort By
score:DESC
Best Match
NVIDIA
Downloadable
nvclip
NV-CLIP is a multimodal embeddings model for image and text.
Model
Computer vision
+3
10mo
Items per page
24
1
1
of 1 pages
39.62K
Google
Free Endpoint
paligemma
Vision language model adept at comprehending text and visual inputs to produce informative responses
Model
image
+8
22.13K
1y
DGX Spark
1 HR
Vision-Language Model Fine-tuning
Fine-tune Vision-Language Models for image and video understanding tasks using Qwen2.5-VL and InternVL3
Playbook
DGX
+6
6mo