NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

94 results for

Filters

  • Free Endpoint
    37
  • Partner Endpoint
    32
  • Download Available
    48
  • Launchable
    3
  • Enterprise
    2
  • Code Generation
    24
  • Object Detection
    7
  • Retrieval Augmented Generation
    4
  • Drug Discovery
    3
  • Image-to-Text
    3
  • Deep Infra
    21
  • Fireworks AI
    20
  • Together AI
    18
  • GMI Cloud
    7
  • CoreWeave
    6
  • NVIDIA
    34
  • Microsoft
    10
  • Google
    8
  • Meta
    7
  • Mistral AI
    5
  • NVIDIA AI
    4
  • Baichuan AI
    Free Endpoint

    baichuan2-13b-chat

    Support Chinese and English chat, coding, math, instruction following, solving quizzes
    Model
    Chinese Language Generation
    288K
    10mo
    Rakuten
    Free Endpoint

    rakutenai-7b-chat

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    chat
    272K
    10mo
    THUDM
    Free Endpoint

    chatglm3-6b

    Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.
    Model
    chat
    285K
    8mo
    NVIDIA
    Free Endpoint

    llama3-chatqa-1.5-8b

    Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.
    Model
    chat
    272K
    10mo
    DGX Spark
    1 HR

    Build and Deploy a Multi-Agent Chatbot

    Deploy a multi-agent chatbot system and chat with agents on your Spark
    Playbook
    DGX
    5mo
    NVIDIA
    Free Endpoint

    nemotron-voicechat

    Nemotron 3 Voicechat
    Model
    English
    4.85K
    2w
    MediaTek
    Free Endpoint

    breeze-7b-instruct

    LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
    Model
    chat
    273K
    10mo
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    chat
    404K
    10mo
    Google
    Free Endpoint

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    chat
    419K
    8mo
    AI21 Labs
    Free Endpoint

    jamba-1.5-mini-instruct

    Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
    Model
    chat
    323K
    10mo
    NVIDIA
    Free Endpoint

    nemotron-4-mini-hindi-4b-instruct

    A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
    Model
    Indic
    402K
    10mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    chat
    328K
    1y
    NVIDIA
    Free Endpoint

    usdcode

    State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.
    Model
    Digital Twin
    129K
    8mo
    Meta
    Downloadable

    llama-3.1-405b-instruct

    Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.
    Model
    chat
    4.03M
    1y
    NVIDIA
    Free Endpoint

    mistral-nemo-minitron-8b-base

    State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
    Model
    language generation
    4.34K
    1y
    NVIDIA
    LaunchableEnterprise

    Build a Video Search and Summarization (VSS) Agent

    Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
    Blueprint
    NVIDIA AI
    1mo
    DeepSeek AI
    Free Endpoint

    deepseek-v3.1-terminus

    DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.
    Model
    chat
    13.89M
    5mo
    Utter-project
    Downloadable

    eurollm-9b-instruct

    State-of-the-art, multilingual model tailored to all 24 official European Union languages.
    Model
    chat
    3.97K270K
    9mo
    Google
    Free Endpoint

    gemma-2-27b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    714K
    10mo
    Gotocompany
    Downloadable

    gemma-2-9b-cpt-sahabatai-instruct

    SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
    Model
    chat
    273K
    9mo
    Google
    Downloadable

    gemma-2-9b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    2.82M
    10mo
    Google
    Downloadable

    gemma-3-1b-it

    A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
    Model
    chat
    4.06K439K
    10mo
    Google
    Free Endpoint

    gemma-3n-e4b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    chat
    519K
    8mo
    Google
    Free Endpoint

    gemma-7b

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    485K
    10mo
    Items per page
    of 4 pages