
Cutting-edge vision-language model exceling in retrieving text and metadata from images.

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Japanese-specialized large-language-model for enterprises to read and understand complex business documents.

ByteDance open-source LLM with long-context, reasoning, and agentic intelligence.

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Multimodal question-answer retrieval representing user queries as text and documents as images.

Multi-modal vision-language model that understands text/img and creates informative responses

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.