NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

16 results for

Filters

  • Free Endpoint
    2
  • Partner Endpoint
    3
  • Download Available
    14
  • Launchable
    0
  • Speech-to-Text
    8
  • Retrieval Augmented Generation
    2
  • Text-to-Embedding
    2
  • Deep Infra
    3
  • Together AI
    2
  • NVIDIA
    12
  • Baidu
    1
  • Meta
    1
  • OpenAI
    1
  • Speakleash
    1
  • NVIDIA AI
    0
  • NVIDIA
    Downloadable

    parakeet-ctc-1.1b-asr

    Record-setting accuracy and performance for English transcription.
    Model
    ASR
    49.84K
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-asr

    State-of-the-art accuracy and speed for English transcriptions.
    Model
    ASR
    8.58K
    9mo
    NVIDIA
    Downloadable

    canary-1b-asr

    Multi-lingual model supporting speech-to-text recognition and translation.
    Model
    Automatic Speech Recognition
    5.48K
    11mo
    NVIDIA
    Downloadable

    nemotron-asr-streaming

    Real-time speech recognition for English
    Model
    Automatic Speech Recognition
    331
    4d
    NVIDIA
    Downloadable

    parakeet-1.1b-rnnt-multilingual-asr

    High accuracy and optimized performance for transcription in 25 languages
    Model
    Automatic Speech Recognition
    31.2K
    10mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-es

    Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
    Model
    ASR
    10
    6mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-vi

    Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
    Model
    ASR
    742
    6mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-cn

    Record-setting accuracy and performance for Mandarin English transcriptions.
    Model
    ASR
    8.09K
    6mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-tw

    Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
    Model
    ASR
    364
    5mo
    OpenAI
    Downloadable

    whisper-large-v3

    Robust Speech Recognition via Large-Scale Weak Supervision.
    Model
    ASR
    52.35K
    11mo
    NVIDIA
    Downloadable

    parakeet-tdt-0.6b-v2

    Accurate and optimized English transcriptions with punctuation and word timestamps
    Model
    ASR
    2.57K
    7mo
    Speakleash
    Free Endpoint

    bielik-11b-v2.6-instruct

    State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.
    Model
    chat
    583K
    6mo
    NVIDIA
    Downloadable

    llama-3.2-nemoretriever-1b-vlm-embed-v1

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    Model
    nemo retriever
    259K
    8mo
    Meta
    Free Endpoint

    llama-guard-4-12b

    Multi-modal model to classify safety for input prompts as well output responses.
    Model
    LLM Multimodal Safety
    498K
    8mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-vl-1b-v2

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    Model
    nemo retriever
    1.21M
    1mo
    Baidu
    Downloadable

    paddleocr

    Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
    Model
    Optical Character Recognition
    85.68K
    8mo
    Items per page
    of 1 pages