NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

58 results for

Filters

  • Free Endpoint
    11
  • Partner Endpoint
    20
  • Download Available
    38
  • Launchable
    4
  • Enterprise
    3
  • Code Generation
    9
  • Object Detection
    7
  • Image-to-Text
    4
  • Retrieval Augmented Generation
    4
  • Drug Discovery
    3
  • Deep Infra
    14
  • Together AI
    10
  • CoreWeave
    6
  • GMI Cloud
    6
  • Lightning AI
    4
  • NVIDIA
    34
  • Mistral AI
    5
  • Meta
    4
  • Google
    3
  • Microsoft
    3
  • NVIDIA AI
    4
  • A100 SXM4 80GB
    1
  • B200
    1
  • GB200
    1
  • GH200 144G HBM3e
    1
  • H100 80GB HBM3
    1
  • DGX Spark
    1 HR

    Build and Deploy a Multi-Agent Chatbot

    Deploy a multi-agent chatbot system and chat with agents on your Spark
    Playbook
    DGX
    7mo
    Items per page
    of 3 pages
    NVIDIA
    Free Endpoint

    nemotron-voicechat

    Nemotron 3 Voicechat
    Model
    English
    2.72K
    1mo
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    Chat
    676K
    11mo
    Google
    Free Endpoint

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    30.28M
    10mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    Chat
    1.14M
    1y
    NVIDIA
    Free Endpoint

    usdcode

    State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.
    Model
    Digital Twin
    10mo
    NVIDIA
    LaunchableEnterprise

    Build a Video Search and Summarization (VSS) Agent

    Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
    Blueprint
    NVIDIA AI
    2mo
    Google
    Free Endpoint

    gemma-3n-e4b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    2.74M
    10mo
    OpenAI
    Downloadable

    gpt-oss-120b

    Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
    Model
    reasoning
    36.6M
    9mo
    OpenAI
    Downloadable

    gpt-oss-20b

    Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math
    Model
    reasoning
    17.38M
    9mo
    Meta
    Downloadable

    llama-3.1-70b-instruct

    Powers complex conversations with superior contextual understanding, reasoning and text generation.
    Model
    Chat
    3.44M
    11mo
    Meta
    Downloadable

    llama-3.2-1b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    chat
    30.18K464K
    11mo
    Meta
    Downloadable

    llama-3.2-3b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    19.49K1.28M
    11mo
    Mistral AI
    Downloadable

    mistral-7b-instruct-v0.3

    This LLM follows instructions, completes requests, and generates creative text.
    Model
    Chat
    373K
    11mo
    Mistral AI
    Deprecation in 9dDownloadable

    mixtral-8x22b-instruct-v0.1

    An MOE LLM that follows instructions, completes requests, and generates creative text.
    Model
    Advanced Reasoning
    2.32M
    9mo
    Mistral AI
    Downloadable

    mixtral-8x7b-instruct-v0.1

    An MOE LLM that follows instructions, completes requests, and generates creative text.
    Model
    Advanced Reasoning
    757K
    9mo
    NVIDIA
    Downloadable

    nemotron-3-super-120b-a12b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    Model
    MoE
    54.2M
    2mo
    Microsoft
    Downloadable

    phi-4-mini-instruct

    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    Model
    Chat
    605K
    11mo
    Upstage
    Free Endpoint

    solar-10.7b-instruct

    Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.
    Model
    Non-Commercial Use Only
    390K
    1y
    Meta
    Downloadable

    llama-3.1-8b-instruct

    Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    30.95M
    10mo
    NVIDIA
    Downloadable

    llama-3.3-nemotron-super-49b-v1

    High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
    Model
    math
    4.06M
    9mo
    NVIDIA
    Downloadable

    llama-3.3-nemotron-super-49b-v1.5

    High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
    Model
    math
    3.32M
    9mo
    Mistral AI
    Downloadable

    ministral-14b-instruct-2512

    A general purpose VLM ideal for chat and instruction based use cases
    Model
    language generation
    2.63M
    5mo
    DGX Spark
    15 MIN

    Open WebUI with Ollama

    Install Open WebUI and use Ollama to chat with models on your Spark
    Playbook
    DGX
    7mo