NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

4 results for

Filters

  • Partner Endpoint
    3
  • Download Available
    3
  • Speech-to-Text
    1
  • Deep Infra
    3
  • Together AI
    3
  • CoreWeave
    2
  • Digital Ocean
    2
  • GMI Cloud
    2
  • OpenAI
    3
  • NVIDIA
    1
  • OpenAI
    Downloadable

    whisper-large-v3

    Robust Speech Recognition via Large-Scale Weak Supervision.
    Model
    ASR
    69.83K
    1y
    OpenAI
    Downloadable

    gpt-oss-120b

    Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
    Model
    reasoning
    36.76M
    8mo
    OpenAI
    Downloadable

    gpt-oss-20b

    Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math
    Model
    reasoning
    7.75M
    8mo
    DGX Spark
    30 MIN

    Run models with llama.cpp on DGX Spark

    Build llama.cpp with CUDA and serve models via an OpenAI-compatible API (Gemma 4 31B IT as example)
    Playbook
    DGX Spark
    1w
    Items per page
    of 1 pages