microsoft/phi-4-multimodal-instruct

PREVIEW

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

I can evaluate images and discuss them with you!