Explore
Models
Blueprints
GPUs
Docs
?
Login
google
paligemma
Vision language model adept at comprehending text and visual inputs to produce informative responses
Language Generation
Vision Assistant
Visual Question Answering
computer vision
cv
image
Image-to-Text
video
vlm
Get API Key
Experience
Model Card
API Reference
Accelerated by DGX Cloud
Sorry, your browser does not support inline SVG.