Multi-modal model to classify safety for input prompts as well output responses.
Multimodal question-answer retrieval representing user queries as text and documents as images.
Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
A multimodal, multilingual 16 MoE model with 17B parameters.
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Vision foundation model capable of performing diverse computer vision and vision language tasks.
Cutting-edge open multimodal model exceling in high-quality reasoning from images.