microsoft /
PREVIEWkosmos-2
Groundbreaking multimodal model designed to understand and reason about visual elements in images.
AI models generate responses and outputs based on complex algorithms and machine learning techniques, and those responses or outputs may be inaccurate, harmful, biased or indecent. By testing this model, you assume the risk of any harm caused by any response or output of the model. Please do not upload any confidential information or personal data unless expressly permitted. Your use is logged for security purposes.
Input
Output
A young family is sitting in the grass with their dog.