Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

10 results for

Filters

  • Download Available
    8
  • Developer Example
    1
  • Launchable
    1
  • Speech-to-Text
    7
  • NVIDIA
    10
  • NVIDIA AI
    1
  • A100 SXM4 80GB
    6
  • DGX Spark
    2
  • H100 80GB HBM3
    2
  • L40S
    2
  • Media
    LaunchableDeveloper Example

    Streaming Data to RAG

    Sensor-captured radio enables real-time awareness, AI-driven analytics for actionable, searchable insights.
    Blueprint
    NVIDIA AI
    3mo
    NVIDIA
    Downloadable

    nemotron-asr-streaming

    Real-time speech recognition for English
    Model
    Automatic Speech Recognition
    Items per page
    of 1 pages
    9.44K
    2mo
    NVIDIA
    Downloadable

    conformer-ctc-asr

    Automatic speech recognition model that transcribes speech in lower case Spanish with record-setting accuracy and performance
    Model
    ASR
    17
    1y
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-es

    Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
    Model
    Streaming
    1.45K
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-vi

    Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
    Model
    Streaming
    134
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-cn

    Record-setting accuracy and performance for Mandarin English transcriptions.
    Model
    Streaming
    9.88K
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-tw

    Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
    Model
    Streaming
    1.5K
    7mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-asr

    State-of-the-art accuracy and speed for English transcriptions.
    Model
    ASR
    2.62K
    11mo
    NVIDIA
    Downloadable

    parakeet-ctc-1.1b-asr

    Record-setting accuracy and performance for English transcription.
    Model
    Streaming
    60K
    11mo
    DGX Spark
    20 MIN

    Live VLM WebUI

    Real-time Vision Language Model interaction with webcam streaming
    Playbook
    Vision AI
    5mo