Search Results

Searching for: Automatic Speech Recognition
Sorting by Most Recent

Transform PDFs into AI podcasts for engaging on-the-go audio content.

Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.

Enhance speech by correcting common audio degradations to create studio quality speech output.

Create intelligent, interactive avatars for customer service across industries

Natural, high-fidelity, English voices for personalizing text-to-speech services and voiceovers

Expressive and engaging English voices for Q&A assistants, brand ambassadors, and service robots

Record-setting accuracy and performance for English transcription.

State-of-the-art accuracy and speed for English transcriptions.

OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.