
nvidia
vila
DeprecatedAPI EndpointMulti-modal vision-language model that understands text/img/video and creates informative responses
This NIM has been deprecated

Multi-modal vision-language model that understands text/img/video and creates informative responses