NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

8 results for

Filters (1)

  • API Endpoint
    6
  • Download Available
    2
  • Code Generation
    1
  • Retrieval Augmented Generation
    0
  • Text-to-Embedding
    0
  • Moonshotai
    2
  • Qwen
    2
  • NVIDIA
    1
  • ByteDance
    1
  • DeepSeek AI
    1
  • chat
  • DeepSeek AI

    deepseek-v3.2

    State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
    Model
    long context
    15.67M
    2mo
    Moonshotai

    kimi-k2-instruct-0905

    Follow-on version of Kimi-K2-Instruct with longer context window and enhanced reasoning capabilities.
    Model
    long-context
    10.15M
    5mo
    Moonshotai

    kimi-k2-thinking

    Open reasoning model with 256K context window, native INT4 quantization and enhanced tool use.
    Model
    Conversational
    3.23M
    3mo
    Qwen

    qwen3-coder-480b-a35b-instruct

    Excels in agentic coding and browser use and supports 256K context, delivering top results.
    Model
    agentic coding
    3.92M
    6mo
    NVIDIA

    nemotron-3-nano-30b-a3b

    Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
    Model
    MoE
    12.92M
    2mo
    Microsoft

    phi-3-small-128k-instruct

    Long context cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    613K
    9mo
    Qwen

    qwen3-next-80b-a3b-instruct

    Qwen3-Next Instruct blends hybrid attention, sparse MoE, and stability boosts for ultra-long context AI.
    Model
    chat
    11.63M
    5mo
    ByteDance

    seed-oss-36b-instruct

    ByteDance open-source LLM with long-context, reasoning, and agentic intelligence.
    Model
    thinking budget
    3.58M
    6mo
    Items per page
    of 1 pages