Explore
Models
Blueprints
GPUs
Docs
?
Login
microsoft
florence-2
PREVIEW
Vision foundation model capable of performing diverse computer vision and vision language tasks.
language generation
multimodal
vision assistant
visual question answering
computer vision
cv
image
image classification
image-to-text
object detection
text-to-image
vlm
Get API Key
This NIM has been deprecated