NVIDIA
Explore Models Blueprints GPUs
Terms of Use

|

Privacy Policy

|

Manage My Privacy

|

Contact

Copyright © 2025 NVIDIA Corporation

Search Results

Searching for: ASR
Sorting by Most Recent

nvidiaparakeet-1.1b-rnnt-multilingual-asr

High accuracy and optimized performance for transcription in 25 languages

asrstreamingspeech-to-textmultilingualnvidia nimnvidia

openaiwhisper-large-v3

Robust Speech Recognition via Large-Scale Weak Supervision.

asrastspeech-to-textbatchwhisperopenaimultilingualnvidia nimnvidia rivaopenai

nvidiacanary-1b-asr

Multi-lingual model supporting speech-to-text recognition and translation.

asraststreamingspeech-to-textbatchspanishmultilingualnvidia nimnvidia rivanvidia

nvidiacanary-0.6b-turbo-asr

Multi-lingual model supporting speech-to-text recognition and translation.

asrastfastspeech-to-textbatchmultilingualnvidia nimnvidia rivanvidia

baidupaddleocr

Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.

optical character recognitiontable extractionoptical character detectionnemo retrieverrun on rtxdata ingestionextractionbaidu

nvidiaconformer-ctc-asr

Automatic speech recognition model that transcribes speech in lower case English with record-setting accuracy and performance

asrstreamingspeech-to-textspanishnvidia nimnvidia rivanvidia

nvidiaparakeet-ctc-1.1b-asr

Record-setting accuracy and performance for English transcription.

asrstreamingenglishspeech-to-textbatchnvidia nimnvidia

nvidiaparakeet-ctc-0.6b-asr

State-of-the-art accuracy and speed for English transcriptions.

asrstreamingenglishbatchrun on rtxspeech-to-textfastnvidia nimnvidia

stabilityaistable-video-diffusion

Stable Video Diffusion (SVD) is a generative diffusion model that leverages a single image as a conditioning frame to synthesize video sequences.

image generationtext-to-imagestabilityai