NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

4 results for

Filters

  • Partner Endpoint
    3
  • Download Available
    3
  • Speech-to-Text
    1
  • Deep Infra
    3
  • Together AI
    3
  • CoreWeave
    2
  • Digital Ocean
    2
  • GMI Cloud
    2
  • OpenAI
    3
  • NVIDIA
    1
  • OpenAI
    Downloadable

    whisper-large-v3

    Robust Speech Recognition via Large-Scale Weak Supervision.
    Model
    ASR
    Items per page
    of 1 pages
    52.74K
    1y
    OpenAI
    Downloadable

    gpt-oss-120b

    Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
    Model
    reasoning
    35.93M
    9mo
    OpenAI
    Downloadable

    gpt-oss-20b

    Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math
    Model
    reasoning
    16.1M
    9mo
    DGX Spark
    30 MIN

    Run models with llama.cpp on DGX Spark

    Build llama.cpp with CUDA and serve models via an OpenAI-compatible API (Nemotron 3 Nano Omni as example)
    Playbook
    DGX Spark
    1mo