NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

17 results for

Filters

  • Free Endpoint
    1
  • Partner Endpoint
    3
  • Download Available
    12
  • Developer Example
    2
  • Launchable
    1
  • Speech-to-Text
    8
  • Retrieval Augmented Generation
    1
  • Text-to-Embedding
    1
  • Deep Infra
    3
  • Together AI
    1
  • NVIDIA
    15
  • Meta
    1
  • OpenAI
    1
  • NVIDIA AI
    2
  • NVIDIA
    Downloadable

    conformer-ctc-asr

    Automatic speech recognition model that transcribes speech in lower case Spanish with record-setting accuracy and performance
    Model
    ASR
    34
    1y
    Items per page
    of 1 pages
    NVIDIA
    Downloadable

    parakeet-ctc-1.1b-asr

    Record-setting accuracy and performance for English transcription.
    Model
    ASR
    71.47K
    11mo
    NVIDIA
    Downloadable

    canary-1b-asr

    Multi-lingual model supporting speech-to-text recognition and translation.
    Model
    Automatic Speech Recognition
    28.35K
    1y
    NVIDIA
    Downloadable

    nemotron-asr-streaming

    Real-time speech recognition for English
    Model
    Automatic Speech Recognition
    16.6K
    2mo
    NVIDIA
    Downloadable

    parakeet-1.1b-rnnt-multilingual-asr

    High accuracy and optimized performance for transcription in 25 languages
    Model
    Automatic Speech Recognition
    26.6K
    1y
    General
    Developer Example

    Nemotron Voice Agent

    Build Real-Time Voice Agents with NVIDIA Nemotron NIM.
    Blueprint
    Voice Agent
    2mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-es

    Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
    Model
    ASR
    1.39K
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-vi

    Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
    Model
    ASR
    130
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-cn

    Record-setting accuracy and performance for Mandarin English transcriptions.
    Model
    ASR
    7.28K
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-tw

    Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
    Model
    ASR
    1.57K
    7mo
    OpenAI
    Downloadable

    whisper-large-v3

    Robust Speech Recognition via Large-Scale Weak Supervision.
    Model
    ASR
    77.68K
    1y
    NVIDIA
    Downloadable

    parakeet-tdt-0.6b-v2

    Accurate and optimized English transcriptions with punctuation and word timestamps
    Model
    ASR
    49.29K
    10mo
    Healthcare & Life Sciences
    LaunchableDeveloper Example

    Ambient Healthcare Agents

    Build advanced AI agents for providers and patients using this developer example powered by NeMo Microservices, NVIDIA Nemotron, Riva ASR and TTS, and NVIDIA LLM NIM
    Blueprint
    NVIDIA AI
    3mo
    Meta
    Free Endpoint

    llama-guard-4-12b

    Multi-modal model to classify safety for input prompts as well output responses.
    Model
    LLM Multimodal Safety
    138K
    11mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-vl-1b-v2

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    Model
    nemo retriever
    7.15M
    3mo
    DGX Spark
    30 MIN

    Run models with llama.cpp on DGX Spark

    Build llama.cpp with CUDA and serve models via an OpenAI-compatible API (Nemotron 3 Nano Omni as example)
    Playbook
    DGX Spark
    1mo
    DGX Spark
    30 MIN

    Vibe Coding in VS Code

    Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue
    Playbook
    DGX
    7mo