stabilityai/stable-diffusion-xl
Generate images and stunning visuals with realistic aesthetics.
Model Overview
Description:
SDXL is a latent diffusion model for text-to-image synthesis. Compared to previous versions of Stable Diffusion, SDXL leverages a three times larger UNet backbone: The increase of model parameters is mainly due to more attention blocks and a larger cross-attention context as SDXL uses a second text encoder.
Model Card
Stable Diffusion XL 1.0 TensorRT Model Card
Stable Diffusion XL 1.0 Base Model Card
Stable Diffusion XL 1.0 Refiner Model Card
Terms of use
By accessing this model, you are agreeing to the SDXL 1.0 terms and conditions of the license, acceptable use policy and stability.ai privacy policy
Third-Party Community Consideration:
This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see Stability-AI's SDXL Model Card.
References(s):
Model Architecture:
Architecture Type: Transformer and Convolutional Neural Network (CNN)
Network Architecture: UNet + attention blocks
Model Version: SDXL 1.0
Input:
Input Format: Text
Input Parameters: scheduler type, denoising steps, classifier-free guidance
Output:
Output Format: Red, Green, Blue (RGB) Image
Output Parameters: 2D
Software Integration:
Supported Hardware Platform(s): Hopper, Ampere/Turing
Supported Operating System(s): Linux
Inference:
Engine: Triton
Test Hardware: H100
Ethical Considerations
NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse. Please report security vulnerabilities or NVIDIA AI Concerns here.