SAM 2 is a segmentation model that enables fast, precise selection of any object in any video or image.
Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
Advanced AI model detects faces and identifies deep fake images.
Robust image classification model for detecting and managing AI-generated content.
NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.
Grounding dino is an open vocabulary zero-shot object detection model.
OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.
EfficientDet-based object detection network to detect 100 specific retail objects from an input video.