Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

62 results for

Filters

  • Free Endpoint
    28
  • Partner Endpoint
    17
  • Download Available
    34
  • Launchable
    4
  • Developer Example
    3
  • Enterprise Blueprint
    3
  • NemoClaw Blueprint
    1
  • Code Generation
    7
  • Image-to-Text
    4
  • Object Detection
    4
  • Drug Discovery
    3
  • Retrieval Augmented Generation
    3
  • Deepinfra
    12
  • Together AI
    8
  • CoreWeave
    6
  • GMI Cloud
    6
  • Bitdeer
    5
  • NVIDIA
    39
  • Meta
    4
  • Google
    3
  • Microsoft
    3
  • Mistral AI
    3
  • Developer
    6
  • Application Developer
    3
  • AI Engineer
    2
  • DevOps Engineer
    2
  • Solutions Architect
    2
  • NVIDIA AI
    5
  • AI And Machine Learning
    2
  • Accelerated Computing
    2
  • Developer Tools
    1
  • Physical AI
    1
  • L40S
    11
  • A100 SXM4 80GB
    10
  • B200
    10
  • H100 80GB HBM3
    10
  • A100 PG509 200
    8
  • Dynamo
    1
  • NeMo RL
    1
  • NeMoClaw
    1
  • Omniverse
    1
  • cuOpt
    1
  • Resemble.AI
    Downloadable

    chatterbox-multilingual-tts

    Natural and expressive voices in 23 languages. For voice agents and brand ambassadors.
    Model
    TTS
    Items per page
    of 3 pages
    7.21K
    7d
    DGX Spark
    1 HR

    Build and Deploy a Multi-Agent Chatbot

    Deploy a multi-agent chatbot system and chat with agents on your Spark
    Playbook
    DGX
    8mo
    NVIDIA
    Free Endpoint

    nemotron-voicechat

    Nemotron 3 Voicechat
    Model
    English
    1.77K
    2mo
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    Chat
    1.42M
    1y
    Google
    Free Endpoint

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    33.75M
    11mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    Chat
    1.53M
    1y
    General
    LaunchableEnterprise

    Build a Video Search and Summarization (VSS) Agent

    Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
    Blueprint
    NVIDIA AI
    3mo
    Google
    Free Endpoint

    gemma-3n-e4b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    3.79M
    11mo
    NVIDIA
    DownloadableFree Endpoint

    nemotron-3-super-120b-a12b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    Model
    MoE
    60.41M
    3mo
    Microsoft
    DownloadableFree Endpoint

    phi-4-mini-instruct

    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    Model
    Chat
    445K
    1y
    Upstage
    Free Endpoint

    solar-10.7b-instruct

    Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.
    Model
    Non-Commercial Use Only
    449K
    1y
    OpenAI
    DownloadableFree Endpoint

    gpt-oss-120b

    Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
    Model
    reasoning
    57.12M
    10mo
    OpenAI
    DownloadableFree Endpoint

    gpt-oss-20b

    Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math
    Model
    reasoning
    18.45M
    10mo
    Meta
    DownloadableFree Endpoint

    llama-3.2-1b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    Language Generation
    45.6K290K
    1y
    Meta
    DownloadableFree Endpoint

    llama-3.2-3b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    Language Generation
    26.7K1.22M
    1y
    Mistral AI
    DownloadableFree Endpoint

    mixtral-8x7b-instruct-v0.1

    An MOE LLM that follows instructions, completes requests, and generates creative text.
    Model
    Advanced Reasoning
    996K
    10mo
    Meta
    DownloadableFree Endpoint

    llama-3.1-70b-instruct

    Powers complex conversations with superior contextual understanding, reasoning and text generation.
    Model
    Chat
    3.9M
    1y
    Meta
    DownloadableFree Endpoint

    llama-3.1-8b-instruct

    Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    25.09M
    11mo
    NVIDIA
    DownloadableFree Endpoint

    llama-3.3-nemotron-super-49b-v1

    High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
    Model
    advanced reasoning
    4.93M
    10mo
    NVIDIA
    DownloadableFree Endpoint

    llama-3.3-nemotron-super-49b-v1.5

    High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
    Model
    advanced reasoning
    3.17M
    10mo
    Mistral AI
    DownloadableFree Endpoint

    ministral-14b-instruct-2512

    A general purpose VLM ideal for chat and instruction based use cases
    Model
    language generation
    3.62M
    6mo
    DGX Spark
    15 MIN

    Open WebUI with Ollama

    Install Open WebUI and use Ollama to chat with models on your Spark
    Playbook
    DGX
    8mo
    Qwen
    DownloadableFree Endpoint

    qwen3.5-122b-a10b

    122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.
    Model
    tool calling
    10.33M
    3mo
    Qwen
    DownloadableFree Endpoint

    qwen3.5-397b-a17b

    Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
    Model
    MoE
    13.15M
    3mo