Cutting-edge vision-language model exceling in retrieving text and metadata from images.
Industry leading jailbreak classification model for protection from adversarial attempts
Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Advanced AI model detects faces and identifies deep fake images.
Generates consistent characters across a series of images without requiring additional training.
Robust image classification model for detecting and managing AI-generated content.
Grounding dino is an open vocabulary zero-shot object detection model.
Creates diverse synthetic data that mimics the characteristics of real-world data.
EfficientDet-based object detection network to detect 100 specific retail objects from an input video.