Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
microsoft
kosmos-2
Deprecated
API Endpoint
Groundbreaking multimodal model designed to understand and reason about visual elements in images.
Image Understanding
Multimodal
Visual Question Answering
computer vision
cv
image
Image-to-Text
video
vlm
Get API Key
This NIM has been deprecated