Skip to main content
Explore
Models
Skills
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
kosmos-2 Model by Microsoft | NVIDIA NIM
Microsoft
kosmos-2
Deprecated
Free Endpoint
Groundbreaking multimodal model designed to understand and reason about visual elements in images.
Image Understanding
Multimodal
Visual Question Answering
computer vision
cv
image
video
vlm
Get API Key
Experience
Experience
API Reference
API Reference
Accelerated by DGX Cloud
This NIM Endpoint has been deprecated
Please transition to another model to avoid any service interruptions.
For more models information, visit our
API Reference