Speech
Automatic Speech Recognition (ASR)
Connect generative AI models to speech by transcribing spoken audio to text.
Convert Text to Speech (TTS)
Voice generative AI models by converting written text to spoken audio.
Run Anywhere
nvidiamagpie-tts-multilingual
Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.
Run Anywhere
nvidiafastpitch-hifigan-tts
Expressive and engaging English voices for Q&A assistants, brand ambassadors, and service robots
Neural Machine Translation (NMT) & Audio Speech Translation (AST)
Create multilingual generative AI models by translating speech and text between languages.
Speech Enhancement
Speech enhancing AI models for common voice degradations.
Run Anywhere
nvidiastudiovoice
Enhance speech by correcting common audio degradations to create studio quality speech output.