Search Results
Searching for: Automatic Speech RecognitionTransform PDFs into AI podcasts for engaging on-the-go audio content.
Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
Enhance speech by correcting common audio degradations to create studio quality speech output.
Create intelligent, interactive avatars for customer service across industries
Natural, high-fidelity, English voices for personalizing text-to-speech services and voiceovers
Expressive and engaging English voices for Q&A assistants, brand ambassadors, and service robots
Record-setting accuracy and performance for English transcription.
State-of-the-art accuracy and speed for English transcriptions.
OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.