Advanced text-to-image model for generating high quality images
Request the model checkpoint from Stability AI
Stable Diffusion 3 Medium is Stability AI's text-to-image model that features greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency. Compared to previous versions of Stable Diffusion versions, Stable Diffusion 3 leverages a novel Multimodal Diffusion Transformer (MMDiT) architecture that combines a diffusion transformer architecture and flow matching. We have optimized the model's speed using NVIDIA's TensorRT (TRT), resulting in significant acceleration with no quality loss.
Developed by: Stability AI
Model type: Generative text-to-image model
By accessing this model, you are agreeing to the Stable Diffusion 3 terms and conditions of the license, acceptable use policy and stability.ai privacy policy
This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see Stability-AI's Stable Diffusion 3 Medium Model Card.
Architecture Type: Transformer and Convolutional Neural Network (CNN)
Network Architecture: Multimodal Diffusion Transformer
Model Version: Stable Diffusion 3 Medium
Input Format: Text
Input Parameters: steps, classifier-free guidance scale, output image aspect ratio, negative prompt
Output Format: Red, Green, Blue (RGB) JPEG Image
Output Size: 1024x1024, 768x1344, 1344x768, 1344x768, 1344x768, 1344x768, 1216x832
Output Parameters: 2D
Supported Hardware Platform(s): Hopper, Ampere/Turing
Supported Operating System(s): Linux
Engine: Triton
Test Hardware: Other
NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse. Please report security vulnerabilities or NVIDIA AI Concerns here.