NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

23 results for

Filters

  • Free Endpoint
    9
  • Partner Endpoint
    18
  • Download Available
    11
  • Deep Infra
    12
  • Bitdeer AI
    8
  • GMI Cloud
    7
  • Together AI
    7
  • Lightning AI
    4
  • Mistral AI
    5
  • NVIDIA
    5
  • DeepSeek AI
    2
  • Minimaxai
    2
  • Moonshotai
    2
  • DGX Spark
    20 MINS

    CLI Coding Agent

    Build local CLI coding agents with Ollama
    Playbook
    Coding
    3d
    Items per page
    of 1 pages
    DGX Station
    30 MINS

    Local Coding Agent

    Run local CLI coding agents with Ollama on DGX Station (NVIDIA GB300) using glm-4.7-flash (fast) or unsloth/GLM-4.7-GGUF:Q8_0 (best quality)
    Playbook
    Coding
    1mo
    DGX Spark
    30 MIN

    Vibe Coding in VS Code

    Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue
    Playbook
    DGX
    6mo
    Mistral AI
    Downloadable

    mistral-medium-3.5-128b

    A high performing model for text generation, coding and agentic use cases
    Model
    coding
    1d
    Moonshotai
    Deprecation in 13dFree Endpoint

    kimi-k2-instruct

    State-of-the-art open mixture-of-experts model with strong reasoning, coding, and agentic capabilities
    Model
    coding
    14.48M
    9mo
    Mistral AI
    Deprecation in 12dFree Endpoint

    magistral-small-2506

    High performance reasoning model optimized for efficiency and edge deployment
    Model
    coding
    1.17M
    9mo
    Stepfun-ai
    Free Endpoint

    step-3.5-flash

    200B open-source reasoning engine with sparse MoE powering frontier agentic AI.
    Model
    Agentic
    8.9M
    2mo
    DeepSeek AI
    Downloadable

    deepseek-v4-pro

    DeepSeek V4 scales to 1M-token context windows with efficient MoE architecture for coding tasks.
    Model
    Moe
    1.23M
    6d
    Google
    Downloadable

    gemma-4-31b-it

    Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.
    Model
    coding
    3.76M
    4w
    Z.ai
    Free Endpoint

    glm-4.7

    GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
    Model
    Tool Calling
    5.5M
    1w
    Z.ai
    Downloadable

    glm-5.1

    GLM-5.1 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.
    Model
    Agentic AI
    3.8M
    1w
    Qwen
    Free Endpoint

    qwen3-coder-480b-a35b-instruct

    Excels in agentic coding and browser use and supports 256K context, delivering top results.
    Model
    agentic coding
    3.21M
    8mo
    DeepSeek AI
    Downloadable

    deepseek-v4-flash

    DeepSeek V4 Flash is a 284B MoE model with 1M-token context optimized for fast coding and agents.
    Model
    coding
    455K
    6d
    Minimaxai
    Deprecation in 13dDownloadable

    minimax-m2.5

    MiniMax M2.5 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.
    Model
    reasoning
    8.91M
    2mo
    Minimaxai
    Free Endpoint

    minimax-m2.7

    MiniMax M2.7 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.
    Model
    coding
    4.69M
    2w
    Mistral AI
    Deprecation in 12dFree Endpoint

    devstral-2-123b-instruct-2512

    State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.
    Model
    coding
    2.55M
    4mo
    Moonshotai
    Deprecation in 6dFree Endpoint

    kimi-k2-instruct-0905

    Follow-on version of Kimi-K2-Instruct with longer context window and enhanced reasoning capabilities.
    Model
    long-context
    8.63M
    7mo
    Sarvamai
    Downloadable

    sarvam-m

    Multilingual, hybrid-reasoning model optimized for Indian language tasks, programming, mathematical reasoning capabilities.
    Model
    coding
    144K
    9mo
    Mistral AI
    Free Endpoint

    mistral-nemotron

    Built for agentic workflows, this model excels in coding, instruction following, and function calling
    Model
    language generation
    5.25M
    10mo
    Mistral AI
    Downloadable

    mistral-small-4-119b-2603

    Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context
    Model
    code generation
    8.35M
    1mo
    Qwen
    Downloadable

    qwen3.5-122b-a10b

    122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.
    Model
    tool calling
    7.55M
    1mo
    NVIDIA
    Downloadable

    nemotron-3-nano-30b-a3b

    Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
    Model
    MoE
    9.28M
    4mo
    NVIDIA
    Downloadable

    nemotron-3-super-120b-a12b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    Model
    MoE
    42.51M
    1mo