NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

34 results for

Filters

  • Free Endpoint
    4
  • Partner Endpoint
    7
  • Download Available
    18
  • Launchable
    6
  • Developer Example
    5
  • Enterprise Blueprint
    2
  • Speech-to-Text
    8
  • Code Generation
    3
  • Optical Character Recognition
    3
  • Image-to-Text
    2
  • Digital Twin
    1
  • Deep Infra
    6
  • Together AI
    2
  • Bitdeer AI
    1
  • GMI Cloud
    1
  • NVIDIA
    26
  • Meta
    3
  • Cyborg
    1
  • DeepSeek AI
    1
  • Google
    1
  • NVIDIA AI
    5
  • NVIDIA Omniverse
    1
  • OpenAI
    Downloadable

    whisper-large-v3

    Robust Speech Recognition via Large-Scale Weak Supervision.
    Model
    ASR
    Items per page
    of 2 pages
    77.68K
    1y
    NVIDIA
    Downloadable

    conformer-ctc-asr

    Automatic speech recognition model that transcribes speech in lower case Spanish with record-setting accuracy and performance
    Model
    ASR
    34
    1y
    NVIDIA
    Downloadable

    parakeet-ctc-1.1b-asr

    Record-setting accuracy and performance for English transcription.
    Model
    ASR
    71.47K
    11mo
    NVIDIA
    Downloadable

    canary-1b-asr

    Multi-lingual model supporting speech-to-text recognition and translation.
    Model
    Automatic Speech Recognition
    28.35K
    1y
    NVIDIA
    Downloadable

    nemotron-asr-streaming

    Real-time speech recognition for English
    Model
    Automatic Speech Recognition
    16.6K
    2mo
    NVIDIA
    Downloadable

    parakeet-1.1b-rnnt-multilingual-asr

    High accuracy and optimized performance for transcription in 25 languages
    Model
    Automatic Speech Recognition
    26.6K
    1y
    General
    Developer Example

    Nemotron Voice Agent

    Build Real-Time Voice Agents with NVIDIA Nemotron NIM.
    Blueprint
    Voice Agent
    2mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-es

    Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
    Model
    ASR
    1.39K
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-vi

    Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
    Model
    ASR
    130
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-cn

    Record-setting accuracy and performance for Mandarin English transcriptions.
    Model
    ASR
    7.28K
    8mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-tw

    Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
    Model
    ASR
    1.57K
    7mo
    DeepSeek AI
    Downloadable

    deepseek-v4-flash

    DeepSeek V4 Flash is a 284B MoE model with 1M-token context optimized for fast coding and agents.
    Model
    coding
    12.88M
    1mo
    NVIDIA
    Downloadable

    parakeet-tdt-0.6b-v2

    Accurate and optimized English transcriptions with punctuation and word timestamps
    Model
    ASR
    49.29K
    10mo
    Healthcare & Life Sciences
    LaunchableDeveloper Example

    Ambient Healthcare Agents

    Build advanced AI agents for providers and patients using this developer example powered by NeMo Microservices, NVIDIA Nemotron, Riva ASR and TTS, and NVIDIA LLM NIM
    Blueprint
    NVIDIA AI
    3mo
    General
    LaunchableEnterprise

    Build an Enterprise RAG Pipeline Blueprint

    Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.
    Blueprint
    NVIDIA AI
    3mo
    Cyborg
    Deprecation in 25dLaunchable

    Cyborg Enterprise RAG

    Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.
    Blueprint
    NIM
    3mo
    Meta
    Downloadable

    llama-3.2-1b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    chat
    39.54K409K
    1y
    Meta
    Downloadable

    llama-3.2-3b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    23.31K1.34M
    1y
    Meta
    Free Endpoint

    llama-guard-4-12b

    Multi-modal model to classify safety for input prompts as well output responses.
    Model
    LLM Multimodal Safety
    138K
    11mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-vl-1b-v2

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    Model
    nemo retriever
    7.15M
    3mo
    General
    LaunchableDeveloper Example

    LLM Router

    Route LLM requests to the best model for the task at hand.
    Blueprint
    NVIDIA AI
    3mo
    DGX Station
    30 MINS

    Local Coding Agent

    Run local CLI coding agents with Ollama on DGX Station (NVIDIA GB300) using glm-4.7-flash (fast) or unsloth/GLM-4.7-GGUF:Q8_0 (best quality)
    Playbook
    Coding
    2mo
    Mistral AI
    Free Endpoint

    mistral-large-3-675b-instruct-2512

    A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
    Model
    language generation
    4.08M
    5mo
    NVIDIA
    Downloadable

    nemoretriever-ocr

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Table Extraction
    14.25K
    10mo