nvidia

vila

DeprecatedAPI Endpoint

Multi-modal vision-language model that understands text/img/video and creates informative responses

This NIM has been deprecated