NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

nvidia

vila

DeprecatedAPI Endpoint

Multi-modal vision-language model that understands text/img/video and creates informative responses

VLMVision language modelimage captionimage to text
Get API Key
This NIM has been deprecated