NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

20 results for

Filters

  • Free Endpoint
    2
  • Partner Endpoint
    3
  • Download Available
    14
  • Launchable
    1
  • Speech-to-Text
    8
  • Retrieval Augmented Generation
    2
  • Text-to-Embedding
    2
  • Deep Infra
    3
  • Together AI
    2
  • NVIDIA
    16
  • Baidu
    1
  • Meta
    1
  • OpenAI
    1
  • Speakleash
    1
  • NVIDIA AI
    2
  • NVIDIA
    Downloadable

    parakeet-ctc-1.1b-asr

    Record-setting accuracy and performance for English transcription.
    Model
    ASR
    49.84K
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-asr

    State-of-the-art accuracy and speed for English transcriptions.
    Model
    ASR
    8.58K
    9mo
    NVIDIA
    Downloadable

    canary-1b-asr

    Multi-lingual model supporting speech-to-text recognition and translation.
    Model
    Automatic Speech Recognition
    5.48K
    11mo
    NVIDIA
    Downloadable

    nemotron-asr-streaming

    Real-time speech recognition for English
    Model
    Automatic Speech Recognition
    331
    4d
    NVIDIA
    Downloadable

    parakeet-1.1b-rnnt-multilingual-asr

    High accuracy and optimized performance for transcription in 25 languages
    Model
    Automatic Speech Recognition
    31.2K
    10mo
    NVIDIA

    Nemotron Voice Agent

    A voice agent that uses the Nemotron model to generate responses to voice commands.
    Blueprint
    Voice Agent
    2w
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-es

    Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
    Model
    ASR
    10
    6mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-vi

    Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
    Model
    ASR
    742
    6mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-cn

    Record-setting accuracy and performance for Mandarin English transcriptions.
    Model
    ASR
    8.09K
    6mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-tw

    Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
    Model
    ASR
    364
    5mo
    OpenAI
    Downloadable

    whisper-large-v3

    Robust Speech Recognition via Large-Scale Weak Supervision.
    Model
    ASR
    52.35K
    11mo
    NVIDIA
    Downloadable

    parakeet-tdt-0.6b-v2

    Accurate and optimized English transcriptions with punctuation and word timestamps
    Model
    ASR
    2.57K
    7mo
    NVIDIA
    Launchable

    Ambient Healthcare Agents

    Build advanced AI agents for providers and patients using this developer example powered by NeMo Microservices, NVIDIA Nemotron, Riva ASR and TTS, and NVIDIA LLM NIM
    Blueprint
    NVIDIA AI
    4w
    Speakleash
    Free Endpoint

    bielik-11b-v2.6-instruct

    State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.
    Model
    chat
    583K
    6mo
    NVIDIA
    Downloadable

    llama-3.2-nemoretriever-1b-vlm-embed-v1

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    Model
    nemo retriever
    259K
    8mo
    Meta
    Free Endpoint

    llama-guard-4-12b

    Multi-modal model to classify safety for input prompts as well output responses.
    Model
    LLM Multimodal Safety
    498K
    8mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-vl-1b-v2

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    Model
    nemo retriever
    1.21M
    1mo
    Baidu
    Downloadable

    paddleocr

    Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
    Model
    Optical Character Recognition
    85.68K
    8mo
    DGX Spark
    30 MIN

    Vibe Coding in VS Code

    Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue
    Playbook
    DGX
    5mo
    DGX Spark

    Vibe Coding in VS Code

    Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue.dev
    Playbook
    DGX
    5mo
    Items per page
    of 1 pages