Explore
Models
Blueprints
Docs
Forums
Login
microsoft
/
phi-4-multimodal-instruct
PREVIEW
Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
chart and table understanding
language generation
speech recognition
visual qa
image-to-text
Build
Experience
Model Card
API Reference
Preview
JSON
Reset Chat
I can evaluate images and discuss them with you!
Experiment with some audio and images we have for you.
Or upload some of your own Images.
0/undefined
0 / 3
.png
.jpg
.jpeg
.mp3
.wav
Record Audio
Upload File
Send
Tools
Enable Tools
View Parameters
Get API Key
Copy Code