Stable Diffusion 3.5 is a popular text-to-image generation model
Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
Multilingual 7B LLM, instruction-tuned on all 24 EU languages for stable, culturally aligned output.
Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Cutting-edge vision-language model exceling in retrieving text and metadata from images.
Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Advanced text-to-image model for generating high quality images