NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

44 results for

Filters (1)

  • Free Endpoint
    25
  • Partner Endpoint
    17
  • Download Available
    19
  • Launchable
    0
  • Enterprise
    0
  • Code Generation
    20
  • Image-to-Text
    3
  • Text Translation
    2
  • Object Detection
    0
  • Retrieval Augmented Generation
    0
  • Fireworks AI
    11
  • Deep Infra
    9
  • Together AI
    9
  • CoreWeave
    3
  • GMI Cloud
    2
  • Microsoft
    9
  • Google
    7
  • Meta
    6
  • NVIDIA
    3
  • Mistral AI
    3
  • NVIDIA AI
    0
  • language generation
  • Baichuan AI
    Free Endpoint

    baichuan2-13b-chat

    Support Chinese and English chat, coding, math, instruction following, solving quizzes
    Model
    Chinese Language Generation
    288K
    10mo
    Rakuten
    Free Endpoint

    rakutenai-7b-chat

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    chat
    272K
    10mo
    THUDM
    Free Endpoint

    chatglm3-6b

    Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.
    Model
    chat
    285K
    8mo
    MediaTek
    Free Endpoint

    breeze-7b-instruct

    LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
    Model
    chat
    273K
    10mo
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    chat
    404K
    10mo
    Google
    Free Endpoint

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    chat
    419K
    8mo
    AI21 Labs
    Free Endpoint

    jamba-1.5-mini-instruct

    Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
    Model
    chat
    323K
    10mo
    NVIDIA
    Free Endpoint

    nemotron-4-mini-hindi-4b-instruct

    A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
    Model
    Indic
    402K
    10mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    chat
    328K
    1y
    NVIDIA
    Free Endpoint

    mistral-nemo-minitron-8b-base

    State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
    Model
    language generation
    4.34K
    1y
    Utter-project
    Downloadable

    eurollm-9b-instruct

    State-of-the-art, multilingual model tailored to all 24 official European Union languages.
    Model
    chat
    3.97K270K
    9mo
    Google
    Free Endpoint

    gemma-2-27b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    714K
    10mo
    Gotocompany
    Downloadable

    gemma-2-9b-cpt-sahabatai-instruct

    SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
    Model
    chat
    273K
    9mo
    Google
    Downloadable

    gemma-2-9b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    2.82M
    10mo
    Google
    Downloadable

    gemma-3-1b-it

    A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
    Model
    chat
    4.06K439K
    10mo
    Google
    Free Endpoint

    gemma-3n-e4b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    chat
    519K
    8mo
    Google
    Free Endpoint

    gemma-7b

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    485K
    10mo
    TokyoTech-LLM
    Downloadable

    llama-3-swallow-70b-instruct-v0.1

    Sovereign AI model trained on Japanese language that understands regional nuances.
    Model
    chat
    281K
    10mo
    Yen-Ting Lin
    Downloadable

    llama-3-taiwan-70b-instruct

    Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.
    Model
    regional language generation
    290K
    10mo
    Meta
    Downloadable

    llama-3.1-70b-instruct

    Powers complex conversations with superior contextual understanding, reasoning and text generation.
    Model
    chat
    8.31M
    9mo
    Institute of Science Tokyo
    Downloadable

    llama-3.1-swallow-70b-instruct-v0.1

    Sovereign AI model trained on Japanese language that understands regional nuances.
    Model
    chat
    278K
    10mo
    Institute of Science Tokyo
    Downloadable

    llama-3.1-swallow-8b-instruct-v0.1

    Sovereign AI model trained on Japanese language that understands regional nuances.
    Model
    chat
    273K
    10mo
    Meta
    Downloadable

    llama-3.2-1b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    chat
    16.5K188K
    10mo
    Meta
    Downloadable

    llama-3.2-3b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    chat
    31.32K933K
    10mo
    Items per page
    of 2 pages