Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Cutting-edge vision-language model exceling in retrieving text and metadata from images.
Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
Connect AI applications to multimodal enterprise data with a scalable retrieval augmented generation (RAG) pipeline built on highly performant, industry-leading NIM microservices, for faster PDF data extraction and more accurate information retrieval.
Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Highly efficient Mixture of Experts model for RAG, summarization, entity extraction, and classification
Advanced text-to-image model for generating high quality images
Generate images and stunning visuals with realistic aesthetics.
Stable Video Diffusion (SVD) is a generative diffusion model that leverages a single image as a conditioning frame to synthesize video sequences.