NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

3 results for

Filters (2)

  • Free Endpoint
    1
  • Partner Endpoint
    3
  • Download Available
    2
  • Code Generation
    0
  • Together AI
    3
  • Deep Infra
    3
  • CoreWeave
    2
  • Digital Ocean
    2
  • GMI Cloud
    2
  • OpenAI
    2
  • Qwen
    1
  • NVIDIA
    0
  • Microsoft
    0
  • Mistral AI
    0
  • Chat
  • reasoning
  • OpenAI
    Downloadable

    gpt-oss-120b

    Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
    Model
    reasoning
    39.68M
    8mo
    Qwen
    DeprecatedFree Endpoint

    qwq-32b

    Powerful reasoning model capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems.
    Model
    coding
    1.05M
    9mo
    OpenAI
    Downloadable

    gpt-oss-20b

    Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math
    Model
    reasoning
    8.51M
    8mo
    Items per page
    of 1 pages