NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

3 results for

Filters (1)

  • Free Endpoint
    2
  • Partner Endpoint
    2
  • Download Available
    1
  • Launchable
    0
  • Enterprise
    0
  • Code Generation
    0
  • Drug Discovery
    0
  • Retrieval Augmented Generation
    0
  • Image-to-Text
    0
  • Object Detection
    0
  • Fireworks AI
    2
  • Deep Infra
    1
  • Together AI
    1
  • GMI Cloud
    1
  • Bitdeer AI
    1
  • Microsoft
    1
  • Qwen
    1
  • ByteDance
    1
  • NVIDIA
    0
  • Meta
    0
  • NVIDIA AI
    0
  • NVIDIA Omniverse
    0
  • NVIDIA BioNemo
    0
  • NVIDIA Isaac GR00T
    0
  • text-generation
  • Microsoft
    Free Endpoint

    phi-4-mini-flash-reasoning

    Lightweight reasoning model for applications in latency bound, memory/compute constrained environments
    Model
    chat
    215K
    8mo
    Qwen
    Downloadable

    qwen3-next-80b-a3b-instruct

    Qwen3-Next Instruct blends hybrid attention, sparse MoE, and stability boosts for ultra-long context AI.
    Model
    chat
    20.32M
    6mo
    ByteDance
    Free Endpoint

    seed-oss-36b-instruct

    ByteDance open-source LLM with long-context, reasoning, and agentic intelligence.
    Model
    chat
    2.27M
    7mo
    Items per page
    of 1 pages