Stability

Stable Diffusion 3

Stable Diffusion 3 (SD3) is a text-to-image generation model developed by Stability AI and released in June 2024. It introduces a Multimodal Diffusion Transformer (MMDiT) architecture that maintains separate weight sets for image and language representations, which improves the model's ability to interpret complex, detailed prompts. The model is available in multiple size variants ranging from 800 million to 8 billion parameters, making it deployable across a range of hardware configurations. One of SD3's most notable characteristics is its ability to render legible text within generated images, a task that has historically been difficult for diffusion-based models. The 8B parameter variant fits within 24GB of VRAM and generates a 1024×1024 image in approximately 34 seconds using 50 sampling steps. SD3 is well suited for creative professionals, developers, and researchers who require high-fidelity image generation with strong alignment to nuanced text prompts.

Unknown 10,000 context N/A output

Text-to-Image Generation Typography Rendering Prompt Adherence Seed Control Style and Format Selection Scalable Model Sizes

Overview ↓ About ↓ Capabilities ↓ Pricing ↓ Parameters ↓ Tools ↓ Resources ↓ Community ↓ FAQ ↓

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

Stability

Input Context Window

The number of tokens supported by the input context window.

10,000 tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

N/A tokens

Open Source

Whether the model's code is available for public use.

Release Date

When the model was first released.

Unknown

Knowledge Cut-off Date

When the model's knowledge was last updated.

Unknown

API Providers

The providers that offer this model. This is not an exhaustive list.

Stability

Modalities

Types of data this model can process.

Image Text

What is Stable Diffusion 3

A fuller summary of positioning, capabilities, and source-specific details for Stable Diffusion 3.

Stable Diffusion 3 (SD3) is a text-to-image generation model developed by Stability AI and released in June 2024. It introduces a Multimodal Diffusion Transformer (MMDiT) architecture that maintains separate weight sets for image and language representations, which improves the model's ability to interpret complex, detailed prompts. The model is available in multiple size variants ranging from 800 million to 8 billion parameters, making it deployable across a range of hardware configurations.

One of SD3's most notable characteristics is its ability to render legible text within generated images, a task that has historically been difficult for diffusion-based models. The 8B parameter variant fits within 24GB of VRAM and generates a 1024×1024 image in approximately 34 seconds using 50 sampling steps. SD3 is well suited for creative professionals, developers, and researchers who require high-fidelity image generation with strong alignment to nuanced text prompts.

Capabilities

What Stable Diffusion 3 supports

IMG

Text-to-Image Generation

Generates images from natural language text prompts using a Multimodal Diffusion Transformer architecture. Supports output at resolutions including 1024×1024 pixels.

Typography Rendering

Renders legible, accurate text within generated images, a capability that diffusion models have historically struggled with. Achieved through the MMDiT architecture's improved language understanding.

Prompt Adherence

Follows detailed and nuanced text prompts closely, including multi-subject scenes and complex compositional instructions. Separate image and language weight sets in MMDiT contribute to this behavior.

Seed Control

Accepts a user-defined seed value to produce reproducible image outputs. Useful for iterating on a composition while holding other variables constant.

Style and Format Selection

Exposes configurable select inputs for controlling generation parameters such as aspect ratio, style, and output format. Multiple select fields are available in the input schema.

Scalable Model Sizes

Available in variants from 800M to 8B parameters to accommodate different hardware constraints. The largest variant requires approximately 24GB of VRAM.

Pricing for Stable Diffusion 3

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Input tokens N/A Per million tokens

Output tokens N/A Per million tokens

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

Stability

Configuration & Parameters

The configurable options currently documented for this model.

SD3 Model

Select

Default: sd3-large

sd3-medium sd3-large sd3-large-turbo

Negative Prompt

Text

A blurb of text describing what you do not wish to see in the output image.

Aspect Ratio

Select

Default: 1:1

1:1 2:3 3:2 4:5 5:4 9:16 9:21 16:9 21:9

Seed

A specific value that is used to guide the 'randomness' of the generation. Omit this parameter or pass 0 to use a random seed.

Output Format

Select

Default: png

jpeg png

Supported Request Parameters

Parameters currently listed by OpenRouter or the local catalog for this model.

SD3 Model Negative Prompt Aspect Ratio Seed Output Format

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Announcement Blog Post Announcements

→

Research Paper (arXiv) Research

→

Platform API Documentation

→

Stable Image Product Page Other

→

Stable Diffusion 3 on Hugging Face Open Source

→

Stability AI Developer Documentation Documentation

→

AI tools related to Stable Diffusion 3

These tools are strongly connected to Stable Diffusion 3 through direct product references, provider mentions, or explicit model mappings.

AI Image Generator

stablediffusion3.net

stablediffusion3.net provides access to Stable Diffusion 3, the advanced text-to-image model from Stability AI. The platform offers resources regarding the Stable Diffusion 3 release date, download options, API access, and free online usage. Users can test Stable Diffusion 3 to explore its features, such as enhanced image fidelity, multi-subject composition, and accurate text rendering.

Free 55 visits 1 saves

AI Image Generator

Stable Diffusion 3 Medium Demo Online

Stable Diffusion 3 Medium (SD3 Medium) is Stability AI's advanced two-billion-parameter text-to-image model. This online demo allows users to generate high-quality, photorealistic images from text prompts for free. The model utilizes Diffusion Transformer architecture to handle complex spatial relationships, compositional elements, and styles, while effectively rendering clear text and improving the accuracy of hands and faces without requiring complex workflows.

Free 0 visits 1 saves

AI Image Generator

Free AI Art Generator

Free AI Art Generator uses advanced algorithms to transform text prompts into AI-generated images. This platform provides a suite of free tools, including options powered by Midjourney, Stable Diffusion 3, DALL·E 3, and Bing AI. Users can generate AI photos and pictures, convert images into videos or anime, and utilize editing features such as face swapping, clothing changes, and background removal, with options to share creations directly to social media.

Free 0 visits 1 saves

AI Image Generator

Picogen

Picogen is an AI image generation API designed for seamless integration into your products. It provides access to Midjourney, DALL-E 2, and Stable Diffusion through a single API, allowing for quick setup in under 5 minutes. Key features include text-to-image generation, image blending, background removal, and upscaling up to 8K resolution.

Free 0 visits 1 saves

Community discussion

What people think about Stable Diffusion 3

Stable Diffusion 3 discussions are most active in r/StableDiffusion, r/singularity, r/LocalLLaMA. Top Reddit threads cluster around benchmark and model-comparison threads, safety and censorship questions.

The strongest match in this snapshot has 2601 upvotes and 280 comments.

r/StableDiffusion 2,601 upvotes 280 comments February 27, 2024

Stable Diffusion 3 will have an open release. Same with video, language, code, 3D, audio etc. Just said by Emad @StabilityAI

Open Reddit thread

r/StableDiffusion 1,556 upvotes 446 comments February 22, 2024

Stable Diffusion 3 the Open Source DALLE 3 or maybe even better....

Open Reddit thread

r/StableDiffusion 1,030 upvotes 803 comments February 22, 2024

Stable Diffusion 3 - Stability AI

Open Reddit thread

r/StableDiffusion 2,020 upvotes 222 comments March 2, 2024

Stable Diffusion XL (SDXL) can now generate transparent images. This is revolutionary. Not Midjourney, not Dall E3, Not even Stable Diffusion 3 can do it.

Open Reddit thread

r/StableDiffusion 983 upvotes 469 comments June 17, 2024

Stable diffusion 3 banned from Civit...

[https://civitai.com/articles/5732](https://civitai.com/articles/5732)

Open Reddit thread

View more discussions →

FAQ

Common questions about Stable Diffusion 3

What is the context window for Stable Diffusion 3?

Stable Diffusion 3 has a context window of 10,000 tokens as listed in its metadata, which governs how much text prompt information the model can process at once.

When was Stable Diffusion 3 trained?

According to the model metadata, Stable Diffusion 3 has a training date of June 2024.

What architecture does Stable Diffusion 3 use?

SD3 uses a Multimodal Diffusion Transformer (MMDiT) architecture, which uses separate sets of weights for image and language representations. This differs from earlier Stable Diffusion versions that used a UNet-based architecture.

What hardware is required to run the largest Stable Diffusion 3 model?

The 8B parameter variant of Stable Diffusion 3 fits within 24GB of VRAM, such as that found on an NVIDIA RTX 4090, and generates a 1024×1024 image in approximately 34 seconds using 50 sampling steps.

What input types does Stable Diffusion 3 accept on MindStudio?

The model accepts text input for the prompt, along with multiple select fields for configuration options such as style or format, and a seed input for reproducible generation.

Who publishes Stable Diffusion 3?

Stable Diffusion 3 is published by Stability AI. It can be accessed via the Stability AI platform API or used directly through MindStudio without requiring separate API key management.

More models from Stability

Continue browsing adjacent models from the same provider.

← All AI Models