NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

4 results for

Filters

  • Partner Endpoint
    3
  • Download Available
    3
  • Speech-to-Text
    1
  • Deep Infra
    3
  • CoreWeave
    2
  • Digital Ocean
    2
  • GMI Cloud
    2
  • Lightning AI
    2
  • OpenAI
    3
  • NVIDIA
    1
  • OpenAI
    Downloadable

    whisper-large-v3

    Robust Speech Recognition via Large-Scale Weak Supervision.
    Model
    ASR
    Items per page
    of 1 pages
    45.82K
    1y
    OpenAI
    Downloadable

    gpt-oss-120b

    Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
    Model
    reasoning
    32.44M
    9mo
    OpenAI
    Downloadable

    gpt-oss-20b

    Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math
    Model
    reasoning
    14.08M
    9mo
    DGX Spark
    30 MIN

    Run models with llama.cpp on DGX Spark

    Build llama.cpp with CUDA and serve models via an OpenAI-compatible API (Nemotron 3 Nano Omni as example)
    Playbook
    DGX Spark
    1mo