Wavespeed

Chroma

Chroma is an 8.9 billion parameter text-to-image model developed by WaveSpeed AI, built on the FLUX.1-schnell architecture. It was trained using over 105,000 hours of NVIDIA H100 GPU time, with a dataset curated from 5 million selected images. The model is designed around a philosophy of unrestricted creative expression, removing the content filters found on many mainstream image generation platforms. It supports image output up to 1536×1536 pixels and is noted for clean renders, natural lighting, strong color harmony, and anatomical accuracy in human figures, hands, and faces. Chroma is well-suited for commercial photography, digital illustration, character design, concept art, and medical or educational illustration where content restrictions would otherwise be a barrier. It handles complex, multi-element scenes involving people, props, and environments with strong prompt adherence. The model responds particularly well to structured prompts organized around subject, context, style, lighting, camera, and mood. It is available through WaveSpeed AI and is optimized for both single-shot and batch generation workflows.

Unknown 10,000 context N/A output

Text-to-Image Generation High-Resolution Output Seed-Based Reproducibility Anatomical Accuracy Unrestricted Content Generation Batch Workflow Support

Overview ↓ About ↓ Capabilities ↓ Pricing ↓ Parameters ↓ Tools ↓ Resources ↓ Community ↓ FAQ ↓

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

Wavespeed

Input Context Window

The number of tokens supported by the input context window.

10,000 tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

N/A tokens

Open Source

Whether the model's code is available for public use.

Release Date

When the model was first released.

Unknown

Knowledge Cut-off Date

When the model's knowledge was last updated.

Unknown

API Providers

The providers that offer this model. This is not an exhaustive list.

Wavespeed

Modalities

Types of data this model can process.

Image Text

What is Chroma

A fuller summary of positioning, capabilities, and source-specific details for Chroma.

Chroma is an 8.9 billion parameter text-to-image model developed by WaveSpeed AI, built on the FLUX.1-schnell architecture. It was trained using over 105,000 hours of NVIDIA H100 GPU time, with a dataset curated from 5 million selected images. The model is designed around a philosophy of unrestricted creative expression, removing the content filters found on many mainstream image generation platforms. It supports image output up to 1536×1536 pixels and is noted for clean renders, natural lighting, strong color harmony, and anatomical accuracy in human figures, hands, and faces.

Chroma is well-suited for commercial photography, digital illustration, character design, concept art, and medical or educational illustration where content restrictions would otherwise be a barrier. It handles complex, multi-element scenes involving people, props, and environments with strong prompt adherence. The model responds particularly well to structured prompts organized around subject, context, style, lighting, camera, and mood. It is available through WaveSpeed AI and is optimized for both single-shot and batch generation workflows.

Capabilities

What Chroma supports

IMG

Text-to-Image Generation

Generates images from text prompts using the FLUX.1-schnell architecture with 8.9 billion parameters. Supports output resolutions up to 1536×1536 pixels.

High-Resolution Output

Produces images at resolutions up to 1536×1536 pixels, configurable via numeric width and height inputs. Suitable for commercial and print-quality use cases.

Seed-Based Reproducibility

Accepts a seed input to enable deterministic image generation, allowing the same prompt and seed combination to reproduce consistent results across runs.

Anatomical Accuracy

Trained with a curated dataset of 5 million images to improve rendering of human figures, hands, and faces with reduced distortion artifacts.

Unrestricted Content Generation

Operates without the content restrictions present on many mainstream platforms, enabling mature artistic, medical, and experimental creative work.

Batch Workflow Support

Optimized for consistent generation across both single-shot and batch workflows, making it practical for high-volume creative production pipelines.

Pricing for Chroma

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Input tokens N/A Per million tokens

Output tokens N/A Per million tokens

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

Wavespeed

Configuration & Parameters

The configurable options currently documented for this model.

Width

Number

Default: 1024 Range: 256 - 1536

Height

Number

Default: 1024 Range: 256 - 1536

Seed

A specific value that is used to guide the 'randomness' of the generation.

Supported Request Parameters

Parameters currently listed by OpenRouter or the local catalog for this model.

Width Height Seed

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Announcement Blog Post Announcements

→

Model Page Playground

→

Official Documentation Documentation

→

AI tools related to Chroma

These tools are strongly connected to Chroma through direct product references, provider mentions, or explicit model mappings.

AI Image Generator

Holara

Holara is an AI-powered platform designed for generating anime-style artwork. Users can create unique images by entering prompts, selecting specific styles, and adjusting various settings. The platform includes features such as image generation, style selection, prompt assistance, and the ability to purchase hologems to increase creation capacity.

Free 28 visits 20 saves

Large Language Models (LLMs)

neoSVG

neoSVG is an AI-powered tool that generates scalable, high-quality vector graphics from text prompts. Its intuitive interface allows users to input descriptions and receive illustrations ready for use in web design, branding, UI/UX projects, and digital art.

Free 8 visits 6 saves

AI Image Generator

Chromatic Lens

Chromatic Lens is an AI-powered application built to enhance product photography. It enables users to produce high-quality visuals through precise editing and background generation, helping businesses create professional product imagery, differentiate their brand, increase sales, and build customer trust.

Free 0 visits 3 saves

Other

Colorific: Color Palette

Colorific helps you create vivid and harmonious color palettes. Generate stunning schemes with one click, including monochromatic, analogous, complementary, split-complementary, tetradic, and triadic options. Use the random generation button for inspiration or import your own colors to find matching complements. You can continuously adjust saved palettes, organize them into folders, and easily copy color codes or share them as images. Supported formats include Hex, RGB, HSV/HSB, HSL, LAB, and HCT, utilizing a perceptually accurate color system that reflects what you see.

Free 0 visits 3 saves

Community discussion

What people think about Chroma

Chroma discussions are most active in r/leagueoflegends, r/Warframe, r/marvelrivals. Top Reddit threads cluster around benchmark and model-comparison threads, safety and censorship questions.

The strongest match in this snapshot has 44516 upvotes and 217 comments.

r/StableDiffusion 31 upvotes 62 comments April 20, 2026

Chroma replacement?

I still use chroma for it's prompt adherence, totally uncensored, and use Klein to refine. I'm just wondering if there is something newer that is as or more uncensored as chroma?

I know it's asking a lot, but it'd be nice to see a model that can handle a prompt describing three or more characters

Open Reddit thread

r/Warframe 2,856 upvotes 231 comments April 3, 2026

[OC] Let's talk for a sec about Chroma

Big boi Chroma is with us since 2015, he's well known as employee of 'Double Cred.' Ind. and as Dad-Frame for babytennos.

After many years his Vex Armor & Effigy \[double credit reasons\] kept him useful till this day, but rest of his kit feels 'underwhelming' for him,

Therefore, for every Chromalover, we shall push the agenda forward, create art, create suggestion, spread awareness, do whatever your hearts desire and we shall rise on our wings!!

Feel free to share your love for Chroma :))

\#JoinTheChromalution
\#ReworkMyBoi

Open Reddit thread

r/StableDiffusion 33 comments March 21, 2026

How do you use Chroma?

I know that because I'm using the flash lora my results are always going to be bad but people constantly call chroma a hidden gen or their favorite model but it seems impossible to get anything that actually looks good. Using the same prompts you would use on Z-Image Turbo or Base gives results that look like a wax figure. Non-photorealistic outputs always look alright at best. At \~30it/s it's incredibly slow as well. Am I missing something? I know some people use it for porn, but I'm certain that even SDXL models would give better results if that's what you want.

Open Reddit thread

r/StableDiffusion 49 upvotes 122 comments October 26, 2025

What's the big deal about Chroma?

I am trying to understand why are people excited about Chroma. For photorealistic images I get improper faces, takes too long and quality is ok.

I use ComfyUI.

What is the use case of Chroma? Am I using it wrong?

Open Reddit thread

r/StableDiffusion 534 upvotes 198 comments June 4, 2025

This sub has SERIOUSLY slept on Chroma. Chroma is basically Flux Pony. It's not merely "uncensored but lacking knowledge." It's the thing many people have been waiting for

I've been active on this sub basically since SD 1.5, and whenever something new comes out that ranges from "doesn't totally suck" to "Amazing," it gets wall to wall threads blanketing the entire sub during what I've come to view as a new model "Honeymoon" phase.

All a model needs to get this kind of attention is to meet the following criteria:

1: new in a way that makes it unique

2: can be run on consumer gpus reasonably

3: at least a 6/10 in terms of how good it is.

So far, anything that meets these 3 gets plastered all over this sub.

The one exception is Chroma, a model I've sporadically seen mentioned on here but never gave much attention to until someone impressed upon me how great it is in discord.

And yeah. This is it. This is Pony Flux. It's what would happen if you could type NLP Flux prompts into Pony.

I am incredibly impressed. With popular community support, this could EASILY dethrone all the other image gen models even hidream.

I like hidream too. But you need a lora for basically EVERYTHING in that and I'm tired of having to train one for every naughty idea.

Hidream also generates the exact same shit every time no matter the seed with only tiny differences. And despite using 4 different text encoders, it can only reliably do 127 tokens of input before it loses coherence. Seriously though all that vram on text encoders so you can enter like 4 fucking sentences at the most before it starts forgetting. I have no idea what they were thinking there.

Hidream DOES have better quality than Chroma but with community support Chroma could EASILY be the best of the best

Open Reddit thread

View more discussions →

FAQ

Common questions about Chroma

What is the context window for Chroma?

Chroma has a context window of 10,000 tokens, which governs the length of text prompts it can process when generating images.

What architecture is Chroma based on?

Chroma is built on the FLUX.1-schnell architecture and has 8.9 billion parameters. It was trained using over 105,000 hours of NVIDIA H100 GPU time.

What image resolutions does Chroma support?

Chroma supports image output up to 1536×1536 pixels. Width and height are configurable via numeric inputs.

When was Chroma's training data collected?

According to the model metadata, Chroma's training date is listed as October 2025. Its dataset was curated from a pool of 5 million selected images.

Does Chroma have content restrictions?

Chroma is described as an uncensored model, meaning it does not apply the content filters common to many mainstream image generation platforms. It is intended for artists, designers, medical illustrators, and other professionals who require unrestricted creative output.

How can I get reproducible results with Chroma?

Chroma accepts a seed input. Using the same prompt, dimensions, and seed value will produce consistent, reproducible image outputs across generation runs.

More models from Wavespeed

Continue browsing adjacent models from the same provider.

← All AI Models