
nvidia
cosmos-reason2-8b
Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
Input
Your question or task. Aim for up to 400 tokens (300 words); max 1000 tokens. Model can accommodate reasoning or non-reasoning answers. Enable reasoning by including this text string in the user prompt: Answer the question using the following format:<think>Your reasoning.</think> Write your final answer immediately after the </think> tag
Defines AI role/rules for session. Max 250 tokens.
You may upload videos for the Cosmos Reason 2 trial experience. NVIDIA will use and store those videos to provide you with this trial experience. Please do not upload videos containing confidential information or personal data. Your use is logged for security, fraud or abuse monitoring. For more information about our data processing practices, see our Privacy Policy. By clicking “Run” you consent to our collection and use of uploaded videos and you agree to the NVIDIA API Trial Service Terms of Use. Your use of the model is governed by NVIDIA Open Models License.