NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

60 results for

Filters

  • Free Endpoint
    39
  • Partner Endpoint
    26
  • Download Available
    22
  • Launchable
    0
  • Code Generation
    22
  • Image-to-Text
    10
  • Text Translation
    2
  • Fireworks AI
    18
  • Deep Infra
    14
  • Together AI
    11
  • Bitdeer AI
    5
  • CoreWeave
    4
  • Microsoft
    10
  • Google
    9
  • Meta
    8
  • Mistral AI
    8
  • NVIDIA
    4
  • NVIDIA AI
    0
  • Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    chat
    564K
    10mo
    MediaTek
    Free Endpoint

    breeze-7b-instruct

    LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
    Model
    chat
    580K
    10mo
    Mistral AI
    Free Endpoint

    mistral-medium-3-instruct

    Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
    Model
    chat
    5.43M
    8mo
    NVIDIA
    Free Endpoint

    nemotron-4-mini-hindi-4b-instruct

    A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
    Model
    Indic
    563K
    10mo
    Google
    Free Endpoint

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    chat
    698K
    8mo
    AI21 Labs
    Free Endpoint

    jamba-1.5-mini-instruct

    Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
    Model
    chat
    573K
    10mo
    Mistral AI
    Downloadable

    ministral-14b-instruct-2512

    A general purpose VLM ideal for chat and instruction based use cases
    Model
    chat
    4.79M
    3mo
    Mistral AI
    Free Endpoint

    mistral-nemotron

    Built for agentic workflows, this model excels in coding, instruction following, and function calling
    Model
    chat
    800K
    9mo
    Mistral AI
    Free Endpoint

    mistral-small-3.1-24b-instruct-2503

    Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
    Model
    chat
    2.2M
    10mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    chat
    575K
    1y
    Meta
    Downloadable

    llama3-8b-instruct

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    chat
    1.13M
    10mo
    Rakuten
    Free Endpoint

    rakutenai-7b-chat

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    chat
    536K
    10mo
    Rakuten
    Free Endpoint

    rakutenai-7b-instruct

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    chat
    540K
    10mo
    NVIDIA
    Free Endpoint

    mistral-nemo-minitron-8b-base

    State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
    Model
    language generation
    3.84K
    1y
    Meta
    Downloadable

    llama3-70b-instruct

    Powers complex conversations with superior contextual understanding, reasoning and text generation.
    Model
    chat
    877K
    10mo
    TokyoTech-LLM
    Downloadable

    llama-3-swallow-70b-instruct-v0.1

    Sovereign AI model trained on Japanese language that understands regional nuances.
    Model
    chat
    539K
    10mo
    Institute of Science Tokyo
    Downloadable

    llama-3.1-swallow-70b-instruct-v0.1

    Sovereign AI model trained on Japanese language that understands regional nuances.
    Model
    chat
    530K
    10mo
    Institute of Science Tokyo
    Downloadable

    llama-3.1-swallow-8b-instruct-v0.1

    Sovereign AI model trained on Japanese language that understands regional nuances.
    Model
    chat
    537K
    10mo
    Microsoft
    Free Endpoint

    phi-3-medium-128k-instruct

    Cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    550K
    10mo
    Microsoft
    Free Endpoint

    phi-3-medium-4k-instruct

    Cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    548K
    10mo
    Microsoft
    Free Endpoint

    phi-3-small-128k-instruct

    Long context cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    583K
    10mo
    Microsoft
    Free Endpoint

    phi-3-small-8k-instruct

    Cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    541K
    10mo
    Qwen
    Free Endpoint

    qwen2-7b-instruct

    Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
    Model
    Chinese Language Generation
    739K
    10mo
    Yen-Ting Lin
    Downloadable

    llama-3-taiwan-70b-instruct

    Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.
    Model
    regional language generation
    543K
    10mo
    Items per page
    of 3 pages