Cutting-edge vision-language model exceling in retrieving text and metadata from images.
Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Highly efficient Mixture of Experts model for RAG, summarization, entity extraction, and classification
Ingest and extract highly accurate insights contained in text, graphs, charts, and tables within massive volumes of PDF documents.