OpenAI

GPT Image 1

GPT Image 1 is OpenAI's flagship image generation model, released in April 2025, designed to convert text descriptions into images and make targeted edits to existing photos. It is built on a unified neural network architecture that processes both text and images together, which allows it to interpret complex, multi-part prompts and produce outputs that closely match the specified intent. The model supports readable text rendering within images, making it practical for use cases like marketing materials, infographics, and product labels. Output formats include square (1024×1024), portrait (1024×1536), and landscape (1536×1024) resolutions, with three quality tiers available. GPT Image 1 is particularly suited for creative professionals, marketers, and developers who need consistent, production-ready visuals. Its region-aware editing capability allows changes to specific parts of an image — such as a background or a single object — without altering unrelated elements like faces, lighting, or logos. The model accepts image inputs alongside text prompts, enabling workflows that involve editing or building upon existing photos. It is accessible via the OpenAI API and is integrated into MindStudio for use without requiring direct API key management.

Unknown 4,000 context N/A output
Text-to-Image Generation Region-Aware Editing Text Rendering in Images Image Input Support Multiple Output Formats Quality Tier Selection

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

OpenAI

Input Context Window

The number of tokens supported by the input context window.

4,000 tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

N/A tokens

Open Source

Whether the model's code is available for public use.

No

Release Date

When the model was first released.

Unknown

Knowledge Cut-off Date

When the model's knowledge was last updated.

Unknown

API Providers

The providers that offer this model. This is not an exhaustive list.

OpenAI API

Modalities

Types of data this model can process.

Image Text

What is GPT Image 1

A fuller summary of positioning, capabilities, and source-specific details for GPT Image 1.

GPT Image 1 is OpenAI's flagship image generation model, released in April 2025, designed to convert text descriptions into images and make targeted edits to existing photos. It is built on a unified neural network architecture that processes both text and images together, which allows it to interpret complex, multi-part prompts and produce outputs that closely match the specified intent. The model supports readable text rendering within images, making it practical for use cases like marketing materials, infographics, and product labels. Output formats include square (1024×1024), portrait (1024×1536), and landscape (1536×1024) resolutions, with three quality tiers available.

GPT Image 1 is particularly suited for creative professionals, marketers, and developers who need consistent, production-ready visuals. Its region-aware editing capability allows changes to specific parts of an image — such as a background or a single object — without altering unrelated elements like faces, lighting, or logos. The model accepts image inputs alongside text prompts, enabling workflows that involve editing or building upon existing photos. It is accessible via the OpenAI API and is integrated into MindStudio for use without requiring direct API key management.

Capabilities

What GPT Image 1 supports

IMG

Text-to-Image Generation

Generates images from text prompts using a unified neural network that processes text and image data together, supporting complex multi-part instructions.

AI

Region-Aware Editing

Edits specific regions of an existing image based on instructions while preserving unspecified elements such as faces, lighting, and logos.

IMG

Text Rendering in Images

Renders legible, accurate text inside generated images, enabling practical use for infographics, product labels, and presentation slides.

IMG

Image Input Support

Accepts arrays of image URLs as input alongside text prompts, enabling editing and transformation workflows on existing photos.

AI

Multiple Output Formats

Supports three output aspect ratios — square (1024×1024), portrait (1024×1536), and landscape (1536×1024) — selectable per request.

AI

Quality Tier Selection

Offers low, medium, and high quality settings so users can balance generation speed against output detail depending on their workflow needs.

Pricing for GPT Image 1

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Price Comparison

Additional usage-cost dimensions synced into the project for this model.

maxTemperature 1

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

OpenAI API

Configuration & Parameters

The configurable options currently documented for this model.

Size

Select
Default: 1024x1024
1024x1024 1536x1024 1024x1536

Background

Select
Default: auto
Auto Opaque Transparent

Source Images

Image URL Array

If you want to edit an existing image, provide the URL(s) or variables

Supported Request Parameters

Parameters currently listed by OpenRouter or the local catalog for this model.

Size Background Source Images

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Community discussion

What people think about GPT Image 1

GPT Image 1 discussions are most active in r/ChatGPT, r/singularity, r/OpenAI. Top Reddit threads cluster around benchmark and model-comparison threads.

The strongest match in this snapshot has 1157 upvotes and 240 comments.

The image generation war just heated up again. OpenAI has officially dropped **GPT-Image-1.5** and it has already dethroned Google on the leaderboards.

**The Benchmarks (LMArena):**

**Rank:** #1 Overall in Text-to-Image With **Score** 1277 (Beating Gemini 3 Pro Image / Nano Banana Pro at 1235).

**Key Upgrades:**

**Speed:** 4x Faster than the previous model (DALL-E 3 / GPT-Image-1).

**Editing:** It supports precise "add, subtract, combine" editing instructions.

**Consistency:** Keeps character appearance and lighting consistent across edits (a major pain point in DALL-E 3).

**Availability:** ChatGPT: Rolling out today to all users via a new "Images" tab in the sidebar.

**API:** Available immediately as gpt-image-1.5.

**Google held the crown with "Nano Banana Pro" for about a month. With OpenAI claiming "4x speed" and better instruction following, is this the DALL-E 3 successor we were waiting for?**

**Source: OpenAI Blog**

🔗: https://openai.com/index/new-chatgpt-images-is-here/

**Video :** https://youtu.be/DPBtd57p5Mg?si=iBlvJ0Km6uUoltYn

Open Reddit thread
r/ChatGPT 288 upvotes 154 comments December 16, 2025
Introducing ChatGPT Images

Introducing ChatGPT Images, powered by our flagship new image generation model. 

* Stronger instruction following
* Precise editing
* Detail preservation
* 4x faster than before

Rolling out today in ChatGPT for all users, and in the API as GPT-Image-1.5.

[https://openai.com/index/new-chatgpt-images-is-here/](https://openai.com/index/new-chatgpt-images-is-here/)

Open Reddit thread
r/singularity 365 upvotes 128 comments December 9, 2025
GPT-IMAGE-2 possibly in LMArena under the name "Hazel-gen"

The model seems very good compared to GPT-IMAGE-1, claims to be from openai, so it's fair to think this is the long awaited GPT-IMAGE-2.

Image prompt - "a table with an analogue clock that read 7:24 and a glass of wine with the wine completely full to the brim"

it's reads about 7:26 so close enough

Edit - I agree with you guys that style wise it isn't very good, however the clock face and full wine glass is a good test that it basically passes, plus the text rendering is good, try it out yourself!

Open Reddit thread
View more discussions →
FAQ

Common questions about GPT Image 1

What is the context window for GPT Image 1?

GPT Image 1 has a context window of 4,000 tokens, which governs the length of text prompt input it can process per request.

What input types does GPT Image 1 accept?

The model accepts text prompts along with arrays of image URLs, allowing both pure text-to-image generation and image editing workflows.

What output resolutions does GPT Image 1 support?

The model supports three output sizes: square at 1024×1024, portrait at 1024×1536, and landscape at 1536×1024 pixels.

When was GPT Image 1 released?

GPT Image 1 was released by OpenAI in April 2025.

Can GPT Image 1 render readable text inside generated images?

Yes. GPT Image 1 is designed to render legible text within images, which makes it suitable for generating materials like infographics, product labels, and slides that require accurate copy.

More models from OpenAI

Continue browsing adjacent models from the same provider.

← All AI Models