
StreamPETR offers efficient 3D object detection for autonomous driving by propagating sparse object queries temporally.

Cutting-edge vision-language model exceling in retrieving text and metadata from images.

Stable Diffusion 3.5 is a popular text-to-image generation model

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Multilingual 7B LLM, instruction-tuned on all 24 EU languages for stable, culturally aligned output.

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Cutting-edge vision-language model exceling in retrieving text and metadata from images.

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

Industry leading jailbreak classification model for protection from adversarial attempts

Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Advanced AI model detects faces and identifies deep fake images.

Robust image classification model for detecting and managing AI-generated content.

Grounding dino is an open vocabulary zero-shot object detection model.

Advanced text-to-image model for generating high quality images

EfficientDet-based object detection network to detect 100 specific retail objects from an input video.