NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

28 results for

Filters

  • Free Endpoint
    4
  • Partner Endpoint
    9
  • Download Available
    24
  • Speech-to-Text
    9
  • Code Generation
    4
  • Optical Character Recognition
    3
  • Image-to-Text
    2
  • Digital Twin
    1
  • Deep Infra
    7
  • Together AI
    3
  • GMI Cloud
    2
  • Bitdeer AI
    1
  • NVIDIA
    16
  • Meta
    4
  • DeepSeek AI
    1
  • Google
    1
  • Mistral AI
    1
  • OpenAI
    Downloadable

    whisper-large-v3

    Robust Speech Recognition via Large-Scale Weak Supervision.
    Model
    ASR
    Items per page
    of 2 pages
    77.68K
    1y
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-asr

    State-of-the-art accuracy and speed for English transcriptions.
    Model
    ASR
    2.84K
    11mo
    NVIDIA
    Downloadable

    conformer-ctc-asr

    Automatic speech recognition model that transcribes speech in lower case Spanish with record-setting accuracy and performance
    Model
    ASR
    34
    1y
    NVIDIA
    Downloadable

    parakeet-ctc-1.1b-asr

    Record-setting accuracy and performance for English transcription.
    Model
    ASR
    71.47K
    11mo
    NVIDIA
    Downloadable

    canary-1b-asr

    Multi-lingual model supporting speech-to-text recognition and translation.
    Model
    Automatic Speech Recognition
    28.35K
    1y
    NVIDIA
    Downloadable

    nemotron-asr-streaming

    Real-time speech recognition for English
    Model
    Automatic Speech Recognition
    16.6K
    2mo
    NVIDIA
    Downloadable

    parakeet-1.1b-rnnt-multilingual-asr

    High accuracy and optimized performance for transcription in 25 languages
    Model
    Automatic Speech Recognition
    26.6K
    1y
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-es

    Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
    Model
    ASR
    1.39K
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-vi

    Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
    Model
    ASR
    130
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-cn

    Record-setting accuracy and performance for Mandarin English transcriptions.
    Model
    ASR
    7.28K
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-tw

    Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
    Model
    ASR
    1.57K
    7mo
    DeepSeek AI
    Downloadable

    deepseek-v4-flash

    DeepSeek V4 Flash is a 284B MoE model with 1M-token context optimized for fast coding and agents.
    Model
    coding
    12.88M
    1mo
    NVIDIA
    Downloadable

    parakeet-tdt-0.6b-v2

    Accurate and optimized English transcriptions with punctuation and word timestamps
    Model
    ASR
    49.29K
    10mo
    Black-forest-labs
    Downloadable

    FLUX.1-dev

    FLUX.1 is a state-of-the-art suite of image generation models
    Model
    Text-to-Image
    147K
    11mo
    Black-forest-labs
    Downloadable

    FLUX.1-schnell

    FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds
    Model
    Text-to-Image
    177K
    11mo
    Black-forest-labs
    Downloadable

    flux.2-klein-4b

    FLUX.2-klein-4B is a distilled image generation and editing model, producing outputs at lighting speed
    Model
    image editing
    277K
    2mo
    Meta
    Downloadable

    llama-3.1-8b-instruct

    Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    35.29M
    10mo
    Meta
    Downloadable

    llama-3.2-1b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    chat
    39.54K409K
    1y
    Meta
    Downloadable

    llama-3.2-3b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    23.31K1.34M
    1y
    Meta
    Free Endpoint

    llama-guard-4-12b

    Multi-modal model to classify safety for input prompts as well output responses.
    Model
    LLM Multimodal Safety
    138K
    11mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-vl-1b-v2

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    Model
    nemo retriever
    7.15M
    3mo
    Mistral AI
    Free Endpoint

    mistral-large-3-675b-instruct-2512

    A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
    Model
    language generation
    4.08M
    5mo
    NVIDIA
    Downloadable

    nemoretriever-ocr

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Table Extraction
    14.25K
    10mo
    NVIDIA
    Downloadable

    nemoretriever-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Table Extraction
    2.07M
    9mo