NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2025 NVIDIA Corporation

Search Results

Searching for: Speech Enhancement
Sorting by Most Recent

nvidiaparakeet-tdt-0.6b-v2

Accurate and optimized English transcriptions with punctuation and word timestamps

ASREnglishNVIDIA NIMNVIDIA Rivaspeech-to-text

nvidiamagpie-tts-flow

Expressive and engaging text-to-speech, generated from a short audio sample.

TTSText-to-SpeechNVIDIA NIMNVIDIA Riva

googlegemma-3n-e4b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generationspeech recognitionVisual QAchat

googlegemma-3n-e2b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generationspeech recognitionVisual QAchat

nvidiaBackground Noise Removal

Removes unwanted noises from audio improving speech intelligibility.

Nvidia MaxineSpeech-to-speechDigital HumanSpeech Enhancement

nvidiamagpie-tts-zeroshot

Expressive and engaging text-to-speech, generated from a short audio sample.

TTSText-to-SpeechNVIDIA NIMNVIDIA Riva

microsoftphi-4-multimodal-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

Speech RecognitionVisual QAchatLanguage GenerationImage-to-TextChart and Table Understanding

openaiwhisper-large-v3

Robust Speech Recognition via Large-Scale Weak Supervision.

ASRASTSpeech-to-TextbatchwhisperOpenAIMultilingualNVIDIA NIMNVIDIA Riva

nvidiacanary-1b-asr

Multi-lingual model supporting speech-to-text recognition and translation.

Automatic Speech RecognitionAutomatic Speech TranslationNVIDIA NIMNVIDIA Riva

nvidiaPDF to Podcast

Transform PDFs into AI podcasts for engaging on-the-go audio content.

blueprintMulti-modalLaunchableText-to-SpeechConversational AIPDF-to-PodcastNVIDIA AIAI Agent

nvidiastudiovoice

Enhance speech by correcting common audio degradations to create studio quality speech output.

Nvidia MaxineSpeech-to-speechDigital HumanRun-on-RTXSpeech Enhancement