NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

41 results for

Filters (2)

  • Free Endpoint
    22
  • Partner Endpoint
    18
  • Download Available
    19
  • Launchable
    0
  • Enterprise
    0
  • Code Generation
    21
  • Text Translation
    2
  • Drug Discovery
    0
  • Retrieval Augmented Generation
    0
  • Image-to-Text
    0
  • Fireworks AI
    14
  • Deep Infra
    12
  • Together AI
    12
  • CoreWeave
    5
  • GMI Cloud
    4
  • Microsoft
    8
  • Meta
    6
  • Google
    5
  • NVIDIA
    4
  • Mistral AI
    3
  • NVIDIA AI
    0
  • NVIDIA Omniverse
    0
  • NVIDIA BioNemo
    0
  • NVIDIA Isaac GR00T
    0
  • Text-to-text
  • chat
  • Baichuan AI
    Free Endpoint

    baichuan2-13b-chat

    Support Chinese and English chat, coding, math, instruction following, solving quizzes
    Model
    Chinese Language Generation
    210K
    10mo
    MediaTek
    Free Endpoint

    breeze-7b-instruct

    LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
    Model
    chat
    188K
    10mo
    THUDM
    Free Endpoint

    chatglm3-6b

    Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.
    Model
    chat
    195K
    8mo
    Utter-project
    Downloadable

    eurollm-9b-instruct

    State-of-the-art, multilingual model tailored to all 24 official European Union languages.
    Model
    chat
    3.71K181K
    9mo
    Google
    Free Endpoint

    gemma-2-27b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    665K
    10mo
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    chat
    328K
    10mo
    Gotocompany
    Downloadable

    gemma-2-9b-cpt-sahabatai-instruct

    SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
    Model
    chat
    182K
    9mo
    Google
    Downloadable

    gemma-2-9b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    2.27M
    10mo
    Google
    Downloadable

    gemma-3-1b-it

    A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
    Model
    chat
    3.69K372K
    10mo
    Google
    Free Endpoint

    gemma-7b

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    407K
    10mo
    OpenAI
    Downloadable

    gpt-oss-120b

    Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
    Model
    reasoning
    44.86M
    8mo
    OpenAI
    Downloadable

    gpt-oss-20b

    Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math
    Model
    reasoning
    8.25M
    8mo
    AI21 Labs
    Free Endpoint

    jamba-1.5-mini-instruct

    Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
    Model
    chat
    246K
    10mo
    Meta
    Downloadable

    llama-3.1-70b-instruct

    Powers complex conversations with superior contextual understanding, reasoning and text generation.
    Model
    chat
    7.78M
    9mo
    Meta
    Downloadable

    llama-3.1-8b-instruct

    Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
    Model
    chat
    11.66M
    8mo
    Meta
    Downloadable

    llama-3.2-1b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    chat
    15.75K129K
    10mo
    Meta
    Downloadable

    llama-3.2-3b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    chat
    30.08K893K
    10mo
    Meta
    Downloadable

    llama3-70b-instruct

    Powers complex conversations with superior contextual understanding, reasoning and text generation.
    Model
    chat
    655K
    10mo
    Meta
    Downloadable

    llama3-8b-instruct

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    chat
    925K
    10mo
    NVIDIA
    Free Endpoint

    llama3-chatqa-1.5-8b

    Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.
    Model
    chat
    187K
    10mo
    Mistral AI
    Downloadable

    mistral-7b-instruct-v0.3

    This LLM follows instructions, completes requests, and generates creative text.
    Model
    chat
    766K
    9mo
    NVIDIA
    Free Endpoint

    mistral-nemo-minitron-8b-base

    State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
    Model
    language generation
    4.15K
    1y
    Mistral AI
    Downloadable

    mixtral-8x22b-instruct-v0.1

    An MOE LLM that follows instructions, completes requests, and generates creative text.
    Model
    chat
    2.86M
    8mo
    Mistral AI
    Downloadable

    mixtral-8x7b-instruct-v0.1

    An MOE LLM that follows instructions, completes requests, and generates creative text.
    Model
    chat
    405K
    8mo
    Items per page
    of 2 pages