NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

68 results for

Filters

  • Free Endpoint
    36
  • Partner Endpoint
    28
  • Download Available
    27
  • Enterprise
    1
  • Launchable
    1
  • Code Generation
    23
  • Text-to-Speech
    3
  • Image Generation
    2
  • Text Translation
    2
  • Text-to-Image
    2
  • Together AI
    21
  • Deep Infra
    20
  • CoreWeave
    8
  • Bitdeer AI
    6
  • Lightning AI
    5
  • NVIDIA
    13
  • Microsoft
    9
  • Google
    7
  • Meta
    7
  • Qwen
    6
  • NVIDIA AI
    1
  • NVIDIA Isaac GR00T
    1
  • NVIDIA Omniverse
    1
  • DGX Spark
    30 MIN

    Text to Knowledge Graph

    Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization
    Playbook
    GraphRAG
    6mo
    DGX Station
    30 MIN

    Text to Knowledge Graph on DGX Station

    Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization
    Playbook
    GraphRAG
    1mo
    IBM
    Free Endpoint

    granite-guardian-3.0-8b

    Detects jailbreaking, bias, violence, profanity, sexual content, and unethical behavior
    Model
    Guardrail
    41.79K
    1y
    Google
    Free Endpoint

    shieldgemma-9b

    Guardrail model to ensure that responses from LLMs are appropriate and safe
    Model
    Guardrail
    66.37K
    1y
    Minimaxai
    Free Endpoint

    minimax-m2.7

    MiniMax M2.7 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.
    Model
    coding
    192K
    2d
    MediaTek
    Free Endpoint

    breeze-7b-instruct

    LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
    Model
    chat
    52.03K
    10mo
    Abacus.AI
    Free Endpoint

    dracarys-llama-3.1-70b-instruct

    Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
    Model
    chat
    242K
    10mo
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    chat
    231K
    10mo
    Google
    Free Endpoint

    gemma-7b

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    270K
    11mo
    AI21 Labs
    Free Endpoint

    jamba-1.5-mini-instruct

    Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
    Model
    chat
    85.34K
    10mo
    NVIDIA
    Free Endpoint

    llama-3.1-nemotron-70b-reward

    Leaderboard topping reward model supporting RLHF for better alignment with human preferences.
    Model
    Text-to-text
    106K
    1y
    NVIDIA
    Free Endpoint

    nemotron-4-mini-hindi-4b-instruct

    A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
    Model
    Indic
    354K
    10mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    chat
    127K
    1y
    Qwen
    Downloadable

    qwen3-next-80b-a3b-thinking

    80B parameter AI model with hybrid reasoning, MoE architecture, support for 119 languages.
    Model
    chat
    1.86M
    7mo
    Baichuan AI
    Free Endpoint

    baichuan2-13b-chat

    Support Chinese and English chat, coding, math, instruction following, solving quizzes
    Model
    Chinese Language Generation
    130K
    10mo
    THUDM
    Free Endpoint

    chatglm3-6b

    Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.
    Model
    chat
    60.14K
    9mo
    DeepSeek AI
    Free Endpoint

    deepseek-v3.1

    DeepSeek V3.1 Instruct is a hybrid AI model with fast reasoning, 128K context, and strong tool use.
    Model
    chat
    10.97M
    7mo
    DeepSeek AI
    Free Endpoint

    deepseek-v3.2

    State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
    Model
    chat
    15.04M
    3mo
    Utter-project
    Downloadable

    eurollm-9b-instruct

    State-of-the-art, multilingual model tailored to all 24 official European Union languages.
    Model
    chat
    4.45K86.39K
    10mo
    Google
    Free Endpoint

    gemma-2-27b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    598K
    10mo
    Gotocompany
    Downloadable

    gemma-2-9b-cpt-sahabatai-instruct

    SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
    Model
    chat
    46.57K
    9mo
    Google
    Downloadable

    gemma-2-9b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    1.26M
    10mo
    Google
    Downloadable

    gemma-3-1b-it

    A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
    Model
    chat
    4.05K259K
    10mo
    Google
    Downloadable

    gemma-4-31b-it

    Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.
    Model
    coding
    1.21M
    1w
    Items per page
    of 3 pages