NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

54 results for

Filters

  • Free Endpoint
    27
  • Partner Endpoint
    23
  • Download Available
    22
  • Enterprise
    1
  • Launchable
    1
  • Code Generation
    19
  • Text-to-Speech
    3
  • Image Generation
    2
  • Text-to-Image
    2
  • AI Agent
    1
  • Deep Infra
    17
  • Together AI
    17
  • GMI Cloud
    11
  • CoreWeave
    7
  • Bitdeer AI
    4
  • NVIDIA
    11
  • Microsoft
    9
  • Meta
    6
  • Qwen
    6
  • Mistral AI
    5
  • NVIDIA AI
    1
  • NVIDIA Isaac GR00T
    1
  • NVIDIA Omniverse
    1
  • DGX Spark
    30 MIN

    Text to Knowledge Graph

    Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization
    Playbook
    GraphRAG
    6mo
    DGX Station
    30 MIN

    Text to Knowledge Graph on DGX Station

    Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization
    Playbook
    GraphRAG
    1mo
    Mistral AI
    Free Endpoint

    devstral-2-123b-instruct-2512

    State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.
    Model
    coding
    3.62M
    4mo
    Qwen
    DeprecatedFree Endpoint

    qwen2.5-coder-7b-instruct

    Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.
    Model
    code completion
    250K
    10mo
    Qwen
    Downloadable

    qwen2.5-coder-32b-instruct

    Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
    Model
    code completion
    2.64M
    9mo
    Google
    DeprecatedFree Endpoint

    shieldgemma-9b

    Guardrail model to ensure that responses from LLMs are appropriate and safe
    Model
    Guardrail
    67.72K
    1y
    Abacus.AI
    Free Endpoint

    dracarys-llama-3.1-70b-instruct

    Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
    Model
    Code Generation
    252K
    10mo
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    Chat
    265K
    10mo
    Google
    Free Endpoint

    gemma-7b

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    Chat
    394K
    11mo
    NVIDIA
    DeprecatedFree Endpoint

    nemotron-4-mini-hindi-4b-instruct

    A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
    Model
    Indic
    387K
    10mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    Chat
    139K
    1y
    Qwen
    Downloadable

    qwen3-next-80b-a3b-thinking

    80B parameter AI model with hybrid reasoning, MoE architecture, support for 119 languages.
    Model
    Reasoning
    1.86M
    7mo
    NVIDIA
    DeprecatedFree Endpoint

    usdsearch

    AI-powered search for OpenUSD data, 3D models, images, and assets using text or image-based inputs.
    Model
    Digital Twin
    895
    1y
    Minimaxai
    Free Endpoint

    minimax-m2.7

    MiniMax M2.7 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.
    Model
    coding
    567K
    4d
    DeepSeek AI
    Free Endpoint

    deepseek-v3.2

    State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
    Model
    long context
    14.89M
    4mo
    DGX Spark
    1 HR

    FLUX.1 Dreambooth LoRA Fine-tuning

    Fine-tune FLUX.1-dev 12B model using Dreambooth LoRA for custom image generation
    Playbook
    Image Generation
    6mo
    Google
    Downloadable

    gemma-4-31b-it

    Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.
    Model
    coding
    1.47M
    2w
    OpenAI
    Downloadable

    gpt-oss-120b

    Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
    Model
    reasoning
    38.53M
    8mo
    Meta
    Downloadable

    llama-3.1-70b-instruct

    Powers complex conversations with superior contextual understanding, reasoning and text generation.
    Model
    Chat
    3.75M
    10mo
    Meta
    Downloadable

    llama-3.2-1b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    chat
    15.76K236K
    11mo
    Meta
    Downloadable

    llama-3.2-3b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    21.34K893K
    10mo
    Meta
    DeprecatedDownloadable

    llama3-8b-instruct

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    Chat
    784K
    10mo
    Mistral AI
    DeprecatedFree Endpoint

    mistral-7b-instruct-v0.2

    This LLM follows instructions, completes requests, and generates creative text.
    Model
    Text-to-Text
    157K
    10mo
    Mistral AI
    Downloadable

    mistral-7b-instruct-v0.3

    This LLM follows instructions, completes requests, and generates creative text.
    Model
    Chat
    721K
    10mo
    Items per page
    of 3 pages