NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

9 results for

Filters (1)

  • Free Endpoint
    0
  • Partner Endpoint
    1
  • Download Available
    8
  • Developer Example
    1
  • Launchable
    0
  • Speech-to-Text
    7
  • Retrieval Augmented Generation
    0
  • Text-to-Embedding
    0
  • Together AI
    1
  • Deep Infra
    0
  • NVIDIA
    8
  • OpenAI
    1
  • Meta
    0
  • NVIDIA AI
    1
  • ASR
  • NVIDIA
    Downloadable

    conformer-ctc-asr

    Automatic speech recognition model that transcribes speech in lower case Spanish with record-setting accuracy and performance
    Model
    ASR
    34
    1y
    Items per page
    of 1 pages
    NVIDIA
    Downloadable

    parakeet-ctc-1.1b-asr

    Record-setting accuracy and performance for English transcription.
    Model
    ASR
    71.47K
    11mo
    General
    Developer Example

    Nemotron Voice Agent

    Build Real-Time Voice Agents with NVIDIA Nemotron NIM.
    Blueprint
    Voice Agent
    2mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-es

    Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
    Model
    ASR
    1.39K
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-vi

    Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
    Model
    ASR
    130
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-cn

    Record-setting accuracy and performance for Mandarin English transcriptions.
    Model
    ASR
    7.28K
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-tw

    Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
    Model
    ASR
    1.57K
    7mo
    OpenAI
    Downloadable

    whisper-large-v3

    Robust Speech Recognition via Large-Scale Weak Supervision.
    Model
    ASR
    77.68K
    1y
    NVIDIA
    Downloadable

    parakeet-tdt-0.6b-v2

    Accurate and optimized English transcriptions with punctuation and word timestamps
    Model
    ASR
    49.29K
    9mo