Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
nvidia
vila
Deprecated
Free Endpoint
Multi-modal vision-language model that understands text/img/video and creates informative responses
VLM
Vision language model
image caption
image to text
Get API Key
Experience
Experience
Model Card
Model Card
API Reference
API Reference
Accelerated by DGX Cloud
This NIM Endpoint has been deprecated