NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

113 results for

Filters (1)

  • Free Endpoint
    58
  • Partner Endpoint
    64
  • Download Available
    56
  • Launchable
    0
  • Enterprise
    0
  • Code Generation
    27
  • Image-to-Text
    11
  • Text Translation
    2
  • Synthetic Data Generation
    1
  • Digital Twin
    1
  • Fireworks AI
    46
  • Deep Infra
    41
  • Together AI
    37
  • GMI Cloud
    22
  • Bitdeer AI
    17
  • NVIDIA
    16
  • Mistral AI
    14
  • Meta
    12
  • Microsoft
    10
  • Qwen
    10
  • NVIDIA AI
    0
  • NVIDIA Omniverse
    0
  • NVIDIA BioNemo
    0
  • NVIDIA Isaac GR00T
    0
  • Chat
  • Baichuan AI
    Free Endpoint

    baichuan2-13b-chat

    Support Chinese and English chat, coding, math, instruction following, solving quizzes
    Model
    Chinese Language Generation
    210K
    10mo
    Speakleash
    Free Endpoint

    bielik-11b-v2.6-instruct

    State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.
    Model
    chat
    214K
    6mo
    MediaTek
    Free Endpoint

    breeze-7b-instruct

    LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
    Model
    chat
    188K
    10mo
    THUDM
    Free Endpoint

    chatglm3-6b

    Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.
    Model
    chat
    195K
    8mo
    Igenius
    Free Endpoint

    colosseum_355b_instruct_16k

    NVIDIA DGX Cloud trained multilingual LLM designed for mission critical use cases in regulated industries including financial services, government, heavy industry
    Model
    chat
    10mo
    DeepSeek AI
    Downloadable

    deepseek-r1-distill-llama-8b

    Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    Distillation
    2.31M
    8mo
    DeepSeek AI
    Downloadable

    deepseek-r1-distill-qwen-14b

    Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    coding
    1.88K2.17M
    10mo
    DeepSeek AI
    Downloadable

    deepseek-r1-distill-qwen-32b

    Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    coding
    2.49K2.54M
    10mo
    DeepSeek AI
    Downloadable

    deepseek-r1-distill-qwen-7b

    Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    coding
    2.31K2.18M
    10mo
    DeepSeek AI
    Free Endpoint

    deepseek-v3.1

    DeepSeek V3.1 Instruct is a hybrid AI model with fast reasoning, 128K context, and strong tool use.
    Model
    chat
    11.34M
    7mo
    DeepSeek AI
    Free Endpoint

    deepseek-v3.1-terminus

    DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.
    Model
    chat
    13.2M
    5mo
    DeepSeek AI
    Free Endpoint

    deepseek-v3.2

    State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
    Model
    chat
    15.74M
    3mo
    Mistral AI
    Free Endpoint

    devstral-2-123b-instruct-2512

    State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.
    Model
    coding
    4.65M
    3mo
    Abacus.AI
    Free Endpoint

    dracarys-llama-3.1-70b-instruct

    Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
    Model
    chat
    341K
    10mo
    Utter-project
    Downloadable

    eurollm-9b-instruct

    State-of-the-art, multilingual model tailored to all 24 official European Union languages.
    Model
    chat
    3.71K181K
    9mo
    Tiiuae
    Free Endpoint

    falcon3-7b-instruct

    Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities
    Model
    chat
    1.74M
    10mo
    Google
    Free Endpoint

    gemma-2-27b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    665K
    10mo
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    chat
    328K
    10mo
    Gotocompany
    Downloadable

    gemma-2-9b-cpt-sahabatai-instruct

    SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
    Model
    chat
    182K
    9mo
    Google
    Downloadable

    gemma-2-9b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    2.27M
    10mo
    Google
    Downloadable

    gemma-3-1b-it

    A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
    Model
    chat
    3.69K372K
    10mo
    Google
    Free Endpoint

    gemma-3-27b-it

    Cutting-edge open multimodal model exceling in high-quality reasoning from images.
    Model
    chat
    6.34M
    10mo
    Google
    Free Endpoint

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    chat
    315K
    8mo
    Google
    Free Endpoint

    gemma-3n-e4b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    chat
    414K
    8mo
    Items per page
    of 5 pages