Cutting-edge vision-language model exceling in retrieving text and metadata from images.
GOVERNING TERMS: Access to this model is governed by the NVIDIA API Trial Terms of Service; use of the model is governed by the NVIDIA Community Model License. Use of the tokenizer included in this model is governed by the CC-BY-4.0 license.