NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

53 results for

Filters

  • Free Endpoint
    28
  • Partner Endpoint
    24
  • Download Available
    25
  • Launchable
    0
  • Code Generation
    11
  • Image-to-Text
    3
  • Digital Twin
    1
  • Image Generation
    1
  • Object Detection
    1
  • Fireworks AI
    21
  • Deep Infra
    18
  • Together AI
    17
  • GMI Cloud
    8
  • Bitdeer AI
    6
  • NVIDIA
    13
  • Mistral AI
    6
  • Qwen
    5
  • Meta
    4
  • Microsoft
    4
  • NVIDIA AI
    0
  • NVIDIA
    Downloadable

    nemoguard-jailbreak-detect

    Industry leading jailbreak classification model for protection from adversarial attempts
    Model
    nemo guardrails
    39.42K
    9mo
    Z.ai
    Free Endpoint

    glm-4.7

    GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
    Model
    Tool Calling
    15.08M
    2mo
    Z.ai
    Downloadable

    glm-5

    GLM-5 744B MoE enables efficient reasoning for complex systems and long-horizon agentic tasks.
    Model
    MoE
    34.64M
    1mo
    NVIDIA
    Downloadable

    llama-3.1-nemoguard-8b-content-safety

    Leading content safety model for enhancing the safety and moderation capabilities of LLMs
    Model
    nemo guardrails
    330K
    1y
    NVIDIA
    Free Endpoint

    llama-3.1-nemotron-safety-guard-8b-v3

    Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs
    Model
    content moderation
    378K
    5mo
    TokyoTech-LLM
    Downloadable

    llama-3-swallow-70b-instruct-v0.1

    Sovereign AI model trained on Japanese language that understands regional nuances.
    Model
    chat
    281K
    10mo
    NVIDIA
    Downloadable

    llama-3.1-nemoguard-8b-topic-control

    Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.
    Model
    nemo guardrails
    295K
    1y
    Meta
    Free Endpoint

    llama-guard-4-12b

    Multi-modal model to classify safety for input prompts as well output responses.
    Model
    LLM Multimodal Safety
    274K
    9mo
    NVIDIA
    Downloadable

    llama-3.2-nemoretriever-1b-vlm-embed-v1

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    Model
    nemo retriever
    205K
    9mo
    Meta
    Downloadable

    llama-3.3-70b-instruct

    Advanced LLM for reasoning, math, general knowledge, and function calling
    Model
    Instruction following
    19.03M
    9mo
    Mistral AI
    Free Endpoint

    mistral-7b-instruct-v0.2

    This LLM follows instructions, completes requests, and generates creative text.
    Model
    chat
    366K
    10mo
    Mistral AI
    Downloadable

    mistral-7b-instruct-v0.3

    This LLM follows instructions, completes requests, and generates creative text.
    Model
    chat
    854K
    9mo
    Mistral AI
    Downloadable

    ministral-14b-instruct-2512

    A general purpose VLM ideal for chat and instruction based use cases
    Model
    chat
    2.9M
    4mo
    MediaTek
    Free Endpoint

    breeze-7b-instruct

    LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
    Model
    chat
    273K
    10mo
    DeepSeek AI
    Free Endpoint

    deepseek-v3.2

    State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
    Model
    chat
    16.4M
    3mo
    Tiiuae
    Free Endpoint

    falcon3-7b-instruct

    Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities
    Model
    chat
    1.83M
    10mo
    Gotocompany
    Downloadable

    gemma-2-9b-cpt-sahabatai-instruct

    SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
    Model
    chat
    273K
    9mo
    OpenAI
    Downloadable

    gpt-oss-120b

    Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
    Model
    reasoning
    46.34M
    7mo
    OpenAI
    Downloadable

    gpt-oss-20b

    Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math
    Model
    reasoning
    8.71M
    7mo
    AI21 Labs
    Free Endpoint

    jamba-1.5-mini-instruct

    Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
    Model
    chat
    323K
    10mo
    Meta
    Downloadable

    llama-3.1-405b-instruct

    Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.
    Model
    chat
    4.03M
    1y
    Meta
    Downloadable

    llama3-8b-instruct

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    chat
    1.02M
    10mo
    NVIDIA
    Free Endpoint

    llama3-chatqa-1.5-8b

    Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.
    Model
    chat
    272K
    10mo
    Mistral AI
    Downloadable

    mixtral-8x22b-instruct-v0.1

    An MOE LLM that follows instructions, completes requests, and generates creative text.
    Model
    chat
    3.38M
    8mo
    Items per page
    of 3 pages