Explore
Models
Blueprints
GPUs
?
Login
microsoft
phi-4-multimodal-instruct
PREVIEW
Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
chart and table understanding
language generation
speech recognition
visual qa
image-to-text
Get API Key
Experience
Model Card
API Reference
Preview
JSON
Reset Chat
I can evaluate images and discuss them with you!
Experiment with some audio and images we have for you.
Or upload some of your own audio and images.
0/undefined
0 / 3
.png
.jpg
.jpeg
.mp3
.wav
Record Audio
Upload File
Send
Tools
Enable Tools
View Parameters
Get API Key
Copied!
Copy Code