NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

61 results for

Filters

  • Free Endpoint
    39
  • Partner Endpoint
    22
  • Download Available
    22
  • Launchable
    1
  • Code Generation
    23
  • Image-to-Text
    10
  • Text Translation
    2
  • Deep Infra
    14
  • Together AI
    11
  • Bitdeer AI
    5
  • CoreWeave
    3
  • GMI Cloud
    3
  • Microsoft
    10
  • Google
    9
  • Meta
    8
  • Mistral AI
    8
  • NVIDIA
    5
  • NVIDIA AI
    1
  • Google
    Free Endpoint

    gemma-7b

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    262K
    11mo
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    chat
    224K
    10mo
    MediaTek
    Free Endpoint

    breeze-7b-instruct

    LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
    Model
    chat
    45.57K
    10mo
    Mistral AI
    Free Endpoint

    mistral-medium-3-instruct

    Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
    Model
    chat
    1.33M
    9mo
    NVIDIA
    Free Endpoint

    nemotron-4-mini-hindi-4b-instruct

    A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
    Model
    Indic
    318K
    10mo
    Google
    Free Endpoint

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    chat
    206K
    8mo
    AI21 Labs
    Free Endpoint

    jamba-1.5-mini-instruct

    Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
    Model
    chat
    81.75K
    10mo
    Mistral AI
    Downloadable

    ministral-14b-instruct-2512

    A general purpose VLM ideal for chat and instruction based use cases
    Model
    chat
    1.35M
    4mo
    Mistral AI
    Free Endpoint

    mistral-large-3-675b-instruct-2512

    A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
    Model
    chat
    5.16M
    4mo
    Mistral AI
    Free Endpoint

    mistral-nemotron

    Built for agentic workflows, this model excels in coding, instruction following, and function calling
    Model
    chat
    2.09M
    10mo
    Mistral AI
    Free Endpoint

    mistral-small-3.1-24b-instruct-2503

    Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
    Model
    chat
    1.6M
    10mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    chat
    89.99K
    1y
    Microsoft
    Deprecation in 4dFree Endpoint

    phi-3.5-vision-instruct

    Cutting-edge open multimodal model exceling in high-quality reasoning from images.
    Model
    Vision Assistant
    1.09M
    1y
    Meta
    Downloadable

    llama3-8b-instruct

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    chat
    750K
    10mo
    Rakuten
    Free Endpoint

    rakutenai-7b-chat

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    chat
    39.2K
    10mo
    Rakuten
    Free Endpoint

    rakutenai-7b-instruct

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    chat
    38.56K
    10mo
    NVIDIA
    Free Endpoint

    mistral-nemo-minitron-8b-base

    State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
    Model
    language generation
    3.54K
    1y
    Meta
    Downloadable

    llama3-70b-instruct

    Powers complex conversations with superior contextual understanding, reasoning and text generation.
    Model
    chat
    810K
    10mo
    TokyoTech-LLM
    Downloadable

    llama-3-swallow-70b-instruct-v0.1

    Sovereign AI model trained on Japanese language that understands regional nuances.
    Model
    chat
    52.5K
    10mo
    Institute of Science Tokyo
    Downloadable

    llama-3.1-swallow-70b-instruct-v0.1

    Sovereign AI model trained on Japanese language that understands regional nuances.
    Model
    chat
    51.01K
    10mo
    Institute of Science Tokyo
    Downloadable

    llama-3.1-swallow-8b-instruct-v0.1

    Sovereign AI model trained on Japanese language that understands regional nuances.
    Model
    chat
    40.34K
    10mo
    Microsoft
    Free Endpoint

    phi-3-medium-128k-instruct

    Cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    86.78K
    10mo
    Microsoft
    Free Endpoint

    phi-3-medium-4k-instruct

    Cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    41.9K
    10mo
    Microsoft
    Free Endpoint

    phi-3-small-128k-instruct

    Long context cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    57.17K
    10mo
    Items per page
    of 3 pages