NVIDIA
Explore
Models
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2025 NVIDIA Corporation

Search Results

Searching for: Automatic Speech Translation
Sorting by Most Recent

nvidiaparakeet-tdt-0.6b-v2

Accurate and optimized English transcriptions with punctuation and word timestamps

nvidiamagpie-tts-flow

Expressive and engaging text-to-speech, generated from a short audio sample.

nvidiariva-translate-4b-instruct

Translation model in 12 languages with few-shots example prompts capability.

nvidiariva-translate-1.6b

Enable smooth global interactions in 36 languages.

googlegemma-3n-e4b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

googlegemma-3n-e2b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

nvidiaBackground Noise Removal

Removes unwanted noises from audio improving speech intelligibility.

nvidiamagpie-tts-zeroshot

Expressive and engaging text-to-speech, generated from a short audio sample.

googlegemma-3-1b-it

A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

microsoftphi-4-multimodal-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

openaiwhisper-large-v3

Robust Speech Recognition via Large-Scale Weak Supervision.

nvidiacanary-1b-asr

Multi-lingual model supporting speech-to-text recognition and translation.

nvidiaPDF to Podcast

Transform PDFs into AI podcasts for engaging on-the-go audio content.

nvidiastudiovoice

Enhance speech by correcting common audio degradations to create studio quality speech output.

nvidiamegatron-1b-nmt

Enable smooth global interactions in 36 languages.

thudmchatglm3-6b

Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.