NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

114 results for

Filters (1)

  • Free Endpoint
    60
  • Partner Endpoint
    63
  • Download Available
    54
  • Launchable
    1
  • Enterprise
    1
  • Code Generation
    27
  • Image-to-Text
    11
  • Text Translation
    2
  • Synthetic Data Generation
    1
  • Digital Twin
    1
  • Fireworks AI
    47
  • Deep Infra
    39
  • Together AI
    38
  • GMI Cloud
    23
  • Bitdeer AI
    18
  • NVIDIA
    17
  • Mistral AI
    14
  • Meta
    12
  • Microsoft
    10
  • Qwen
    10
  • NVIDIA AI
    1
  • NVIDIA Omniverse
    0
  • NVIDIA BioNemo
    0
  • NVIDIA Isaac GR00T
    0
  • chat
  • Baichuan AI
    Free Endpoint

    baichuan2-13b-chat

    Support Chinese and English chat, coding, math, instruction following, solving quizzes
    Model
    Chinese Language Generation
    590K
    10mo
    Speakleash
    Free Endpoint

    bielik-11b-v2.6-instruct

    State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.
    Model
    chat
    586K
    6mo
    MediaTek
    Free Endpoint

    breeze-7b-instruct

    LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
    Model
    chat
    581K
    10mo
    NVIDIA
    LaunchableEnterprise

    Build a Video Search and Summarization (VSS) Agent

    Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
    Blueprint
    NVIDIA AI
    1mo
    THUDM
    Free Endpoint

    chatglm3-6b

    Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.
    Model
    chat
    609K
    8mo
    Igenius
    Free Endpoint

    colosseum_355b_instruct_16k

    NVIDIA DGX Cloud trained multilingual LLM designed for mission critical use cases in regulated industries including financial services, government, heavy industry
    Model
    chat
    80.81K
    10mo
    DeepSeek AI
    Downloadable

    deepseek-r1-distill-llama-8b

    Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    Distillation
    5.12M
    8mo
    DeepSeek AI
    Downloadable

    deepseek-r1-distill-qwen-14b

    Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    coding
    1.87K4.79M
    10mo
    DeepSeek AI
    Downloadable

    deepseek-r1-distill-qwen-32b

    Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    coding
    2.22K5.33M
    10mo
    DeepSeek AI
    Downloadable

    deepseek-r1-distill-qwen-7b

    Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    coding
    2.06K5.04M
    10mo
    DeepSeek AI
    Free Endpoint

    deepseek-v3.1

    DeepSeek V3.1 Instruct is a hybrid AI model with fast reasoning, 128K context, and strong tool use.
    Model
    chat
    12.86M
    6mo
    DeepSeek AI
    Free Endpoint

    deepseek-v3.1-terminus

    DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.
    Model
    chat
    13.49M
    5mo
    DeepSeek AI
    Free Endpoint

    deepseek-v3.2

    State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
    Model
    chat
    15.68M
    3mo
    Mistral AI
    Free Endpoint

    devstral-2-123b-instruct-2512

    State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.
    Model
    coding
    6.25M
    3mo
    Abacus.AI
    Free Endpoint

    dracarys-llama-3.1-70b-instruct

    Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
    Model
    chat
    633K
    10mo
    Utter-project
    Downloadable

    eurollm-9b-instruct

    State-of-the-art, multilingual model tailored to all 24 official European Union languages.
    Model
    chat
    4.3K535K
    9mo
    Tiiuae
    Free Endpoint

    falcon3-7b-instruct

    Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities
    Model
    chat
    2.02M
    10mo
    Google
    Free Endpoint

    gemma-2-27b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    809K
    10mo
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    chat
    566K
    10mo
    Gotocompany
    Downloadable

    gemma-2-9b-cpt-sahabatai-instruct

    SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
    Model
    chat
    536K
    9mo
    Google
    Downloadable

    gemma-2-9b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    4.79M
    10mo
    Google
    Downloadable

    gemma-3-1b-it

    A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
    Model
    chat
    4.45K551K
    10mo
    Google
    Free Endpoint

    gemma-3-27b-it

    Cutting-edge open multimodal model exceling in high-quality reasoning from images.
    Model
    chat
    5.56M
    10mo
    Google
    Free Endpoint

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    chat
    714K
    8mo
    Items per page
    of 5 pages