Copyright © 2026 NVIDIA Corporation
Vision language model adept at comprehending text and visual inputs to produce informative responses