NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

18 results for

Filters

  • Free Endpoint
    11
  • Partner Endpoint
    5
  • Download Available
    6
  • Launchable
    1
  • Code Generation
    10
  • Deep Infra
    5
  • Together AI
    3
  • Microsoft
    7
  • Mistral AI
    2
  • Qwen
    2
  • Rakuten
    2
  • NVIDIA
    1
  • NVIDIA AI
    1
  • Stockmark
    Downloadable

    stockmark-2-100b-instruct

    Japanese-specialized large-language-model for enterprises to read and understand complex business documents.
    Model
    sovereign ai
    1.21M
    6mo
    Microsoft
    DeprecatedFree Endpoint

    phi-3-medium-128k-instruct

    Cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    Chat
    94.58K
    11mo
    Microsoft
    DeprecatedFree Endpoint

    phi-3-medium-4k-instruct

    Cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    Chat
    55.24K
    11mo
    Microsoft
    DeprecatedFree Endpoint

    phi-3-small-128k-instruct

    Long context cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    Chat
    60.31K
    11mo
    Microsoft
    DeprecatedFree Endpoint

    phi-3-small-8k-instruct

    Cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    Chat
    43.87K
    11mo
    Meta
    DeprecatedDownloadable

    llama3-8b-instruct

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    Chat
    712K
    11mo
    Qwen
    DeprecatedFree Endpoint

    qwen2-7b-instruct

    Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
    Model
    Chinese Language Generation
    135K
    11mo
    Qwen
    DeprecatedDownloadable

    qwen2.5-7b-instruct

    Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
    Model
    Chinese Language Generation
    7.16M
    11mo
    Rakuten
    DeprecatedFree Endpoint

    rakutenai-7b-chat

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    Chat
    45.01K
    11mo
    Rakuten
    DeprecatedFree Endpoint

    rakutenai-7b-instruct

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    Chat
    44.78K
    11mo
    Microsoft
    DeprecatedFree Endpoint

    phi-3-mini-128k-instruct

    Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
    Model
    Chat
    104K
    11mo
    Microsoft
    DeprecatedDownloadable

    phi-3-mini-4k-instruct

    Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
    Model
    Chat
    77.23K
    11mo
    Microsoft
    DeprecatedFree Endpoint

    phi-3.5-mini-instruct

    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    Model
    Chat
    1.43M
    1y
    AI Singapore
    DeprecatedFree Endpoint

    sea-lion-7b-instruct

    LLM to represent and serve the linguistic and cultural diversity of Southeast Asia
    Model
    Chat
    1y
    Upstage
    Free Endpoint

    solar-10.7b-instruct

    Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.
    Model
    Non-Commercial Use Only
    155K
    1y
    Mistral AI
    Downloadable

    mixtral-8x22b-instruct-v0.1

    An MOE LLM that follows instructions, completes requests, and generates creative text.
    Model
    Advanced Reasoning
    2.38M
    9mo
    Mistral AI
    Downloadable

    mixtral-8x7b-instruct-v0.1

    An MOE LLM that follows instructions, completes requests, and generates creative text.
    Model
    Advanced Reasoning
    454K
    9mo
    NVIDIA
    Launchable

    AI Agent for Telecom Network Configuration Planning

    Automate and optimize the configuration of radio access network (RAN) parameters using agentic AI and a large language model (LLM)-driven framework.
    Blueprint
    NVIDIA AI
    1mo
    Items per page
    of 1 pages