NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

94 results for

Filters

  • Free Endpoint
    39
  • Partner Endpoint
    30
  • Download Available
    46
  • Launchable
    3
  • Enterprise
    2
  • Code Generation
    24
  • Object Detection
    7
  • Drug Discovery
    3
  • Image-to-Text
    3
  • Retrieval Augmented Generation
    3
  • Fireworks AI
    20
  • Deep Infra
    18
  • Together AI
    17
  • GMI Cloud
    7
  • CoreWeave
    6
  • NVIDIA
    34
  • Microsoft
    10
  • Google
    8
  • Meta
    7
  • Mistral AI
    5
  • NVIDIA AI
    4
  • Baichuan AI
    Free Endpoint

    baichuan2-13b-chat

    Support Chinese and English chat, coding, math, instruction following, solving quizzes
    Model
    Chinese Language Generation
    590K
    10mo
    Rakuten
    Free Endpoint

    rakutenai-7b-chat

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    chat
    538K
    10mo
    THUDM
    Free Endpoint

    chatglm3-6b

    Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.
    Model
    chat
    609K
    8mo
    NVIDIA
    Free Endpoint

    llama3-chatqa-1.5-8b

    Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.
    Model
    chat
    564K
    10mo
    DGX Spark
    1 HR

    Build and Deploy a Multi-Agent Chatbot

    Deploy a multi-agent chatbot system and chat with agents on your Spark
    Playbook
    DGX
    5mo
    NVIDIA
    Free Endpoint

    nemotron-voicechat

    Nemotron 3 Voicechat
    Model
    English
    1.61K
    6d
    MediaTek
    Free Endpoint

    breeze-7b-instruct

    LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
    Model
    chat
    581K
    10mo
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    chat
    566K
    10mo
    Google
    Free Endpoint

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    chat
    714K
    8mo
    AI21 Labs
    Free Endpoint

    jamba-1.5-mini-instruct

    Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
    Model
    chat
    577K
    10mo
    NVIDIA
    Free Endpoint

    nemotron-4-mini-hindi-4b-instruct

    A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
    Model
    Indic
    564K
    10mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    chat
    578K
    1y
    NVIDIA
    Free Endpoint

    usdcode

    State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.
    Model
    Digital Twin
    332K
    8mo
    Meta
    Downloadable

    llama-3.1-405b-instruct

    Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.
    Model
    chat
    3.39M
    1y
    NVIDIA
    Free Endpoint

    mistral-nemo-minitron-8b-base

    State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
    Model
    language generation
    4K
    1y
    NVIDIA
    LaunchableEnterprise

    Build a Video Search and Summarization (VSS) Agent

    Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
    Blueprint
    NVIDIA AI
    1mo
    DeepSeek AI
    Free Endpoint

    deepseek-v3.1-terminus

    DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.
    Model
    chat
    13.49M
    5mo
    Utter-project
    Downloadable

    eurollm-9b-instruct

    State-of-the-art, multilingual model tailored to all 24 official European Union languages.
    Model
    chat
    4.3K535K
    9mo
    Google
    Free Endpoint

    gemma-2-27b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    809K
    10mo
    Gotocompany
    Downloadable

    gemma-2-9b-cpt-sahabatai-instruct

    SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
    Model
    chat
    536K
    9mo
    Google
    Downloadable

    gemma-2-9b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    4.79M
    10mo
    Google
    Downloadable

    gemma-3-1b-it

    A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
    Model
    chat
    4.45K551K
    10mo
    Google
    Free Endpoint

    gemma-3n-e4b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    chat
    784K
    8mo
    Google
    Free Endpoint

    gemma-7b

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    574K
    10mo
    Items per page
    of 4 pages