Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Ingest and extract highly accurate insights contained in text, graphs, charts, and tables within massive volumes of PDF documents.