Cutting-edge vision-language model exceling in retrieving text and metadata from images.
GOVERNING TERMS: Access to this model is governed by the NVIDIA API Trial Terms of Service; use of the model is governed by the NVIDIA Community Model License.