NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

68 results for

Filters

  • Free Endpoint
    36
  • Partner Endpoint
    28
  • Download Available
    27
  • Enterprise
    1
  • Launchable
    1
  • Code Generation
    23
  • Text-to-Speech
    3
  • Image Generation
    2
  • Text Translation
    2
  • Text-to-Image
    2
  • Together AI
    21
  • Deep Infra
    20
  • GMI Cloud
    12
  • CoreWeave
    8
  • Bitdeer AI
    6
  • NVIDIA
    13
  • Microsoft
    9
  • Google
    7
  • Meta
    7
  • Qwen
    6
  • NVIDIA AI
    1
  • NVIDIA Isaac GR00T
    1
  • NVIDIA Omniverse
    1
  • DGX Spark
    30 MIN

    Text to Knowledge Graph

    Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization
    Playbook
    GraphRAG
    6mo
    DGX Station
    30 MIN

    Text to Knowledge Graph on DGX Station

    Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization
    Playbook
    GraphRAG
    1mo
    IBM
    DeprecatedFree Endpoint

    granite-guardian-3.0-8b

    Detects jailbreaking, bias, violence, profanity, sexual content, and unethical behavior
    Model
    Guardrail
    44.67K
    1y
    Google
    DeprecatedFree Endpoint

    shieldgemma-9b

    Guardrail model to ensure that responses from LLMs are appropriate and safe
    Model
    Guardrail
    67.72K
    1y
    MediaTek
    DeprecatedFree Endpoint

    breeze-7b-instruct

    LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
    Model
    Chat
    56.15K
    10mo
    Abacus.AI
    Free Endpoint

    dracarys-llama-3.1-70b-instruct

    Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
    Model
    Code Generation
    252K
    10mo
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    Chat
    265K
    10mo
    Google
    Free Endpoint

    gemma-7b

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    Chat
    394K
    11mo
    AI21 Labs
    DeprecatedFree Endpoint

    jamba-1.5-mini-instruct

    Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
    Model
    Chat
    85.13K
    10mo
    NVIDIA
    DeprecatedFree Endpoint

    llama-3.1-nemotron-70b-reward

    Leaderboard topping reward model supporting RLHF for better alignment with human preferences.
    Model
    Text-to-text
    247K
    1y
    NVIDIA
    DeprecatedFree Endpoint

    nemotron-4-mini-hindi-4b-instruct

    A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
    Model
    Indic
    387K
    10mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    Chat
    139K
    1y
    Qwen
    Downloadable

    qwen3-next-80b-a3b-thinking

    80B parameter AI model with hybrid reasoning, MoE architecture, support for 119 languages.
    Model
    Reasoning
    1.86M
    7mo
    Minimaxai
    Free Endpoint

    minimax-m2.7

    MiniMax M2.7 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.
    Model
    coding
    567K
    4d
    Baichuan AI
    DeprecatedFree Endpoint

    baichuan2-13b-chat

    Support Chinese and English chat, coding, math, instruction following, solving quizzes
    Model
    Chinese Language Generation
    149K
    10mo
    THUDM
    DeprecatedFree Endpoint

    chatglm3-6b

    Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.
    Model
    Text Translation
    65.61K
    9mo
    DeepSeek AI
    DeprecatedFree Endpoint

    deepseek-v3.1

    DeepSeek V3.1 Instruct is a hybrid AI model with fast reasoning, 128K context, and strong tool use.
    Model
    Reasoning
    10.99M
    7mo
    DeepSeek AI
    Free Endpoint

    deepseek-v3.2

    State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
    Model
    long context
    14.89M
    4mo
    Utter-project
    DeprecatedDownloadable

    eurollm-9b-instruct

    State-of-the-art, multilingual model tailored to all 24 official European Union languages.
    Model
    Sovereign AI
    4.39K92.63K
    10mo
    Google
    DeprecatedFree Endpoint

    gemma-2-27b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    Chat
    645K
    10mo
    Gotocompany
    DeprecatedDownloadable

    gemma-2-9b-cpt-sahabatai-instruct

    SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
    Model
    Sovereign AI
    73.63K
    10mo
    Google
    DeprecatedDownloadable

    gemma-2-9b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    1.35M
    10mo
    Google
    DeprecatedDownloadable

    gemma-3-1b-it

    A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
    Model
    Translation
    4.03K288K
    10mo
    Google
    Downloadable

    gemma-4-31b-it

    Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.
    Model
    coding
    1.47M
    1w
    Items per page
    of 3 pages