nvidia
vila
PREVIEWMulti-modal vision-language model that understands text/img/video and creates informative responses
AI models generate responses and outputs based on complex algorithms and machine learning techniques, and those responses or outputs may be inaccurate, harmful, biased or indecent. By testing this model, you assume the risk of any harm caused by any response or output of the model. Please do not upload any confidential information or personal data unless expressly permitted. Your use is logged for security purposes.
Input
You may upload images for the VILA trial experience. NVIDIA will only use and store those images to provide you with this trial experience. Please do not upload images containing confidential information or personal data. Your use is logged for security purposes. For more information about our data processing practices, see our Privacy Policy. By clicking “Run” you consent to our collection and use of uploaded images and you agree to the NVIDIA API Trial Service Terms of Use. ADDITIONAL INFORMATION: For the NVIDIA Vila model: BigVision project model: Apache 2.0 license; and Yi Series Model Yi-34B: Apache 2.0 license.