Sorting by Most Recent

State-of-the-art accuracy and speed for English transcriptions.

Enhance speech by correcting common audio degradations to create studio quality speech output.

Natural, high-fidelity, English voices for personalizing text-to-speech services and voiceovers

Expressive and engaging English voices for Q&A assistants, brand ambassadors, and service robots

Record-setting accuracy and performance for English transcription.

State-of-the-art accuracy and speed for English transcriptions.

nvidia/ocdrnet

OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.