Multi-modal vision-language model that understands text/img and creates informative responses
Governing Terms: Your use of the service is governed by the NVIDIA API Catalog Terms of Service. Your use of the model is governed by the NVIDIA Open License Agreement. Additional Information: Llama 3.1 Community Model License; Built with Llama.