NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

54 results for

Filters

  • Free Endpoint
    27
  • Partner Endpoint
    23
  • Download Available
    22
  • Enterprise
    1
  • Launchable
    1
  • Code Generation
    19
  • Text-to-Speech
    3
  • Image Generation
    2
  • Text-to-Image
    2
  • AI Agent
    1
  • Deep Infra
    17
  • Together AI
    17
  • GMI Cloud
    11
  • CoreWeave
    7
  • Bitdeer AI
    4
  • NVIDIA
    11
  • Microsoft
    9
  • Meta
    6
  • Qwen
    6
  • Mistral AI
    5
  • NVIDIA AI
    1
  • NVIDIA Isaac GR00T
    1
  • NVIDIA Omniverse
    1
  • DGX Spark
    30 MIN

    Text to Knowledge Graph

    Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization
    Playbook
    GraphRAG
    6mo
    DGX Station
    30 MIN

    Text to Knowledge Graph on DGX Station

    Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization
    Playbook
    GraphRAG
    1mo
    NVIDIA
    Launchable

    PDF to Podcast

    Transform PDFs into AI podcasts for engaging on-the-go audio content.
    Blueprint
    NVIDIA AI
    1mo
    Google
    DeprecatedFree Endpoint

    shieldgemma-9b

    Guardrail model to ensure that responses from LLMs are appropriate and safe
    Model
    Guardrail
    65.03K
    1y
    Mistral AI
    Free Endpoint

    devstral-2-123b-instruct-2512

    State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.
    Model
    coding
    3.52M
    4mo
    Abacus.AI
    Free Endpoint

    dracarys-llama-3.1-70b-instruct

    Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
    Model
    Code Generation
    312K
    11mo
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    Chat
    451K
    11mo
    Google
    DeprecatedFree Endpoint

    gemma-7b

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    Chat
    443K
    11mo
    NVIDIA
    DeprecatedFree Endpoint

    nemotron-4-mini-hindi-4b-instruct

    A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
    Model
    Indic
    362K
    11mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    Chat
    190K
    1y
    Qwen
    DeprecatedFree Endpoint

    qwen2.5-coder-7b-instruct

    Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.
    Model
    code completion
    241K
    11mo
    Qwen
    Downloadable

    qwen3-next-80b-a3b-thinking

    80B parameter AI model with hybrid reasoning, MoE architecture, support for 119 languages.
    Model
    Reasoning
    1.9M
    7mo
    NVIDIA
    DeprecatedFree Endpoint

    usdsearch

    AI-powered search for OpenUSD data, 3D models, images, and assets using text or image-based inputs.
    Model
    Digital Twin
    827
    1y
    Minimaxai
    Free Endpoint

    minimax-m2.7

    MiniMax M2.7 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.
    Model
    coding
    1.57M
    6d
    DeepSeek AI
    Free Endpoint

    deepseek-v3.2

    State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
    Model
    long context
    14.23M
    4mo
    DGX Spark
    1 HR

    FLUX.1 Dreambooth LoRA Fine-tuning

    Fine-tune FLUX.1-dev 12B model using Dreambooth LoRA for custom image generation
    Playbook
    Image Generation
    6mo
    Google
    Downloadable

    gemma-4-31b-it

    Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.
    Model
    coding
    2.08M
    2w
    OpenAI
    Downloadable

    gpt-oss-120b

    Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
    Model
    reasoning
    39.81M
    8mo
    Meta
    Downloadable

    llama-3.1-70b-instruct

    Powers complex conversations with superior contextual understanding, reasoning and text generation.
    Model
    Chat
    3.6M
    10mo
    Meta
    Downloadable

    llama-3.2-1b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    chat
    16.24K324K
    11mo
    Meta
    Downloadable

    llama-3.2-3b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    17.63K1.01M
    11mo
    Meta
    DeprecatedDownloadable

    llama3-8b-instruct

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Model
    Chat
    712K
    11mo
    Mistral AI
    DeprecatedFree Endpoint

    mistral-7b-instruct-v0.2

    This LLM follows instructions, completes requests, and generates creative text.
    Model
    Text-to-Text
    126K
    11mo
    Mistral AI
    Downloadable

    mistral-7b-instruct-v0.3

    This LLM follows instructions, completes requests, and generates creative text.
    Model
    Chat
    897K
    10mo
    Items per page
    of 3 pages