Efficient hybrid state-space model designed for conversational and reasoning tasks.
Cutting-edge vision-language model exceling in high-quality reasoning from images.
Cutting-edge vision-Language model exceling in high-quality reasoning from images.
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.
Vision foundation model capable of performing diverse computer vision and vision language tasks.
Verify compatibility of OpenUSD assets with instant RTX render and rule-based validation.
OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.
Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask
A generative model of protein backbones for protein binder design.
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Vision language model adept at comprehending text and visual inputs to produce informative responses
Generate images and stunning visuals with realistic aesthetics.
Groundbreaking multimodal model designed to understand and reason about visual elements in images.
One-shot visual language understanding model that translates images of plots into tables.
Multi-modal vision-language model that understands text/images and generates informative responses