Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

24 results for

Filters

  • Free Endpoint
    17
  • Partner Endpoint
    15
  • Download Available
    14
  • Developer Example
    1
  • Image-to-Text
    1
  • OpenRouter
    13
  • Deepinfra
    11
  • GMI Cloud
    8
  • Together AI
    8
  • Bitdeer
    6
  • NVIDIA
    10
  • Mistral AI
    3
  • DeepSeek AI
    2
  • Minimaxai
    2
  • Stepfun ai
    2
  • AI Engineer
    2
  • Developer
    2
  • Application Developer
    1
  • DevOps Engineer
    1
  • Ml Engineer
    1
  • Developer Tools
    2
  • B200
    5
  • H200
    4
  • H100 80GB HBM3
    3
  • GB200
    1
  • L40S
    1
  • NeMo RL
    1
  • NeMoClaw
    1
  • DGX Spark
    20 MINS

    CLI Coding Agent

    Build local CLI coding agents with Ollama
    Playbook
    Coding
    2mo
    Items per page
    of 1 pages
    DGX Station
    30 MINS

    Local Coding Agent

    Run local CLI coding agents with Claude Code and Ollama on DGX Station (NVIDIA GB300) using qwen3.6:27b
    Playbook
    Coding
    3mo
    DGX Spark
    30 MIN

    Vibe Coding in VS Code

    Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue
    Playbook
    DGX
    8mo
    DGX Station
    15 MIN

    DGX Station AI Skills for Coding Agents

    Give your coding agent (Claude Code, Codex, Gemini CLI, Cursor) DGX Station expertise via an AGENTS.md and on-demand Agent Skills
    Playbook
    vLLM
    1mo
    Mistral AI
    DownloadableFree Endpoint

    mistral-medium-3.5-128b

    A high performing model for text generation, coding and agentic use cases
    Model
    coding
    4M
    2mo
    Stepfun-ai
    Free Endpoint

    step-3.5-flash

    200B open-source reasoning engine with sparse MoE powering frontier agentic AI.
    Model
    Agentic
    12M
    4mo
    DeepSeek AI
    DownloadableFree Endpoint

    deepseek-v4-pro

    DeepSeek V4 scales to 1M-token context windows with efficient MoE architecture for coding tasks.
    Model
    Moe
    8M
    2mo
    Z.ai
    DownloadableFree Endpoint

    glm-5.1

    GLM-5.1 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.
    Model
    Agentic AI
    32M
    2mo
    Stepfun-ai
    DownloadableFree Endpoint

    step-3.7-flash

    A sparse MoE multimodal reasoning model good for enterprise, agentic and coding tasks.
    Model
    Coding
    4M
    1mo
    Minimaxai
    Free Endpoint

    minimax-m3

    MiniMax M3 Preview is a multimodal MoE vision-language model with strong reasoning, coding, and tool-calling capabilities.
    Model
    coding
    6M
    17d
    Sarvamai
    DownloadableFree Endpoint

    sarvam-m

    Multilingual, hybrid-reasoning model optimized for Indian language tasks, programming, mathematical reasoning capabilities.
    Model
    coding
    236K
    11mo
    Google
    DownloadableFree Endpoint

    gemma-4-31b-it

    Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.
    Model
    reasoning
    5M
    2mo
    DeepSeek AI
    DownloadableFree Endpoint

    deepseek-v4-flash

    DeepSeek V4 Flash is a 284B MoE model with 1M-token context optimized for fast coding and agents.
    Model
    MoE
    15M
    2mo
    Minimaxai
    DownloadableFree Endpoint

    minimax-m2.7

    MiniMax M2.7 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.
    Model
    reasoning
    14M
    2mo
    Moonshotai
    DownloadableFree Endpoint

    kimi-k2.6

    1T multimodal MoE for long-horizon coding, agentic tool use, and image/video understanding.
    Model
    Multimodal
    15M
    1mo
    Mistral AI
    Free Endpoint

    mistral-nemotron

    Built for agentic workflows, this model excels in coding, instruction following, and function calling
    Model
    language generation
    1M
    1y
    Mistral AI
    DownloadableFree Endpoint

    mistral-small-4-119b-2603

    Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context
    Model
    code generation
    13M
    3mo
    Qwen
    DownloadableFree Endpoint

    qwen3.5-122b-a10b

    122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.
    Model
    tool calling
    10M
    3mo
    NVIDIA
    DownloadableFree Endpoint

    nemotron-3-nano-30b-a3b

    Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
    Model
    MoE
    13M
    6mo
    NVIDIA
    DownloadableFree Endpoint

    nemotron-3-super-120b-a12b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    Model
    MoE
    60M
    3mo
    NVIDIA
    DownloadableFree Endpoint

    nemotron-3-ultra-550b-a55b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    Model
    Agent
    8M
    25d

    Manage durable working-session memory for coding agents. Use when a user asks to preserve or recover agent context across disconnects, VS Code restarts, long-running work, handoffs, or any session where important state should be written periodically under
    Skill
    Developer
    860
    25d

    Guides human users' AI agents to the NemoClaw docs MCP server and canonical Fern documentation in Markdown form. Use when users ask how to install, configure, operate, troubleshoot, secure, or learn NemoClaw with an AI coding assistant. Trigger keywords -
    Skill
    Developer
    160
    3d
    General
    Developer Example

    Nsight Copilot - AI Code Assistant for CUDA Development

    Deploy an AI-powered coding assistant on DGX Spark that delivers expert CUDA-aware chat, real-time code completion, and retrieval-augmented generation grounded in authoritative GPU programming knowledge—powered by NVIDIA NIM microservices.
    Blueprint
    19d