NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

21 results for

Filters

  • API Endpoint
    12
  • Download Available
    5
  • Launchable
    4
  • Image-to-Text
    1
  • NVIDIA
    4
  • Mistral AI
    3
  • Qwen
    3
  • DeepSeek AI
    2
  • Moonshotai
    2
  • NVIDIA AI
    4
  • NVIDIA
    Launchable

    Safety for Agentic AI

    Improve safety, security, and privacy of AI systems at build, deploy and run stages.
    Blueprint
    security
    2w
    Weights & Biases
    Launchable

    Traceability for Agentic AI

    Trace and evaluate AI Agents with Weights & Biases.
    Blueprint
    Traceability
    2w
    Z.ai

    glm5

    GLM-5 744B MoE enables efficient reasoning for complex systems and long-horizon agentic tasks.
    Model
    MoE
    7.94M
    3w
    Stepfun-ai

    step-3.5-flash

    200B open-source reasoning engine with sparse MoE powering frontier agentic AI.
    Model
    Agentic
    7.29M
    1mo
    Minimaxai

    minimax-m2.1

    MiniMax M2.1 excels in multi-language coding, app/web dev, office AI, and agent integration
    Model
    Agentic
    8.38M
    1mo
    Qwen

    qwen3-next-80b-a3b-instruct

    Qwen3-Next Instruct blends hybrid attention, sparse MoE, and stability boosts for ultra-long context AI.
    Model
    chat
    11.15M
    5mo
    Moonshotai

    kimi-k2-instruct

    State-of-the-art open mixture-of-experts model with strong reasoning, coding, and agentic capabilities
    Model
    coding
    20.23M
    7mo
    Qwen

    qwen3-coder-480b-a35b-instruct

    Excels in agentic coding and browser use and supports 256K context, delivering top results.
    Model
    agentic coding
    3.83M
    6mo
    Qwen

    qwen3.5-397b-a17b

    Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
    Model
    MoE
    6.55M
    3w
    Mistral AI

    mistral-large-3-675b-instruct-2512

    A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
    Model
    language generation
    6.17M
    3mo
    DeepSeek AI

    deepseek-v3.1-terminus

    DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.
    Model
    tool calling
    13.01M
    5mo
    Mistral AI

    devstral-2-123b-instruct-2512

    State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.
    Model
    coding
    5.78M
    3mo
    Moonshotai

    kimi-k2-instruct-0905

    Follow-on version of Kimi-K2-Instruct with longer context window and enhanced reasoning capabilities.
    Model
    long-context
    10.04M
    5mo
    Viavi
    Launchable

    Intent-Driven RAN Energy Efficiency Blueprint

    Build a closed-loop agentic workflow for energy optimization.
    Blueprint
    nim
    1w
    DeepSeek AI

    deepseek-v3.2

    State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
    Model
    long context
    15.64M
    2mo
    Z.ai

    glm4.7

    GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
    Model
    Tool Calling
    17.75M
    1mo
    NVIDIA

    llama-3.1-nemotron-nano-8b-v1

    Leading reasoning and agentic AI accuracy model for PC and edge.
    Model
    chat
    606K
    8mo
    Mistral AI

    mistral-nemotron

    Built for agentic workflows, this model excels in coding, instruction following, and function calling
    Model
    language generation
    721K
    9mo
    NVIDIA

    nvidia-nemotron-nano-9b-v2

    High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.
    Model
    thinking budget
    753K
    6mo
    ByteDance

    seed-oss-36b-instruct

    ByteDance open-source LLM with long-context, reasoning, and agentic intelligence.
    Model
    thinking budget
    3.46M
    6mo
    NVIDIA
    Launchable

    AI Agent for Telecom Network Configuration Planning

    Automate and optimize the configuration of radio access network (RAN) parameters using agentic AI and a large language model (LLM)-driven framework.
    Blueprint
    nim
    2w
    Items per page
    of 1 pages