NVIDIA
Explore Models Blueprints GPUs
Terms of Use

|

Privacy Policy

|

Manage My Privacy

|

Contact

Copyright © 2025 NVIDIA Corporation

google

paligemma

PREVIEW

Vision language model adept at comprehending text and visual inputs to produce informative responses

language generationvision assistantvisual question answeringcomputer visioncvimageimage-to-textvideovlm
Get API Key
API Reference
Accelerated by DGX Cloud
Sorry, your browser does not support inline SVG.