NVIDIA
Explore
Models
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2025 NVIDIA Corporation

Search Results

Searching for: Automatic Speech Translation
Sorting by Most Recent

nvidiaparakeet-tdt-0.6b-v2

Accurate and optimized English transcriptions with punctuation and word timestamps

ASREnglishNVIDIA NIMNVIDIA Rivaspeech-to-text

nvidiamagpie-tts-flow

Expressive and engaging text-to-speech, generated from a short audio sample.

TTSText-to-SpeechNVIDIA NIMNVIDIA Riva

nvidiariva-translate-4b-instruct

Translation model in 12 languages with few-shots example prompts capability.

Text Translationchat

nvidiariva-translate-1.6b

Enable smooth global interactions in 36 languages.

Text TranslationNeural machine translationNVIDIA NIM

googlegemma-3n-e4b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generationspeech recognitionVisual QAchat

googlegemma-3n-e2b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generationspeech recognitionVisual QAchat

nvidiaBackground Noise Removal

Removes unwanted noises from audio improving speech intelligibility.

Nvidia MaxineSpeech-to-speechDigital HumanSpeech Enhancement

nvidiamagpie-tts-zeroshot

Expressive and engaging text-to-speech, generated from a short audio sample.

TTSText-to-SpeechNVIDIA NIMNVIDIA Riva

googlegemma-3-1b-it

A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

TranslationchatText-to-TextLanguage Generation

microsoftphi-4-multimodal-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

Speech RecognitionVisual QAchatLanguage GenerationImage-to-TextChart and Table Understanding

openaiwhisper-large-v3

Robust Speech Recognition via Large-Scale Weak Supervision.

ASRASTSpeech-to-TextbatchwhisperOpenAIMultilingualNVIDIA NIMNVIDIA Riva

nvidiacanary-1b-asr

Multi-lingual model supporting speech-to-text recognition and translation.

Automatic Speech RecognitionAutomatic Speech TranslationNVIDIA NIMNVIDIA Riva

nvidiastudiovoice

Enhance speech by correcting common audio degradations to create studio quality speech output.

Nvidia MaxineSpeech-to-speechDigital HumanRun-on-RTXSpeech Enhancement

nvidiamegatron-1b-nmt

Enable smooth global interactions in 36 languages.

Text TranslationNeural machine translationNVIDIA NIM

thudmchatglm3-6b

Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.

Text TranslationchatCode GenerationText-to-TextRegional Language Generation