A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
A multimodal, multilingual 16 MoE model with 17B parameters.
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
Connect AI applications to multimodal enterprise data with a scalable retrieval augmented generation (RAG) pipeline built on highly performant, industry-leading NIM microservices, for faster PDF data extraction and more accurate information retrieval.
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Vision foundation model capable of performing diverse computer vision and vision language tasks.
Cutting-edge open multimodal model exceling in high-quality reasoning from images.