OpenAI

GPT 5.4

GPT-5.4 is a text generation model developed by OpenAI, released in March 2026 as their flagship model for professional and enterprise use. It is available in three variants — standard, Thinking, and Pro — and features a context window of 1 million tokens, the largest OpenAI has offered. The model is designed not only to plan complex tasks but to complete them reliably, with built-in computer use capabilities for orchestrating multi-step agentic workflows. GPT-5.4 is best suited for enterprise teams running AI in production environments, including customer support automation, document drafting, data analysis, and developer workflows. It recorded an 83% score on GDPval for knowledge work tasks and ranked second out of 116 models on the Artificial Analysis Intelligence Index. The Pro variant adds multi-path reasoning evaluation for scenarios where analytical depth is prioritized over speed, such as scientific research and complex decision-making.

Mar 05, 2026 1050K context 128,000 tokens output

Agentic Workflows 1M Token Context Extended Reasoning Artifact Generation Reduced Hallucinations Token-Efficient Output

Overview ↓ About ↓ Capabilities ↓ Pricing ↓ Price Comparison ↓ Providers ↓ Benchmarks ↓ Compare ↓ Tools ↓ Resources ↓ Community ↓ FAQ ↓

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

OpenAI

Model ID

The routed model identifier exposed by upstream providers.

openai/gpt-5.4

Input Context Window

The number of tokens supported by the input context window.

1050K tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

128,000 tokens tokens

Open Source

Whether the model's code is available for public use.

Release Date

When the model was first released.

Mar 05, 2026 3 months ago

Knowledge Cut-off Date

When the model's knowledge was last updated.

March 2026

API Providers

The providers that offer this model. This is not an exhaustive list.

OpenAI, Azure

Modalities

Types of data this model can process.

Text Image File

What is GPT 5.4

A fuller summary of positioning, capabilities, and source-specific details for GPT 5.4.

GPT-5.4 is a text generation model developed by OpenAI, released in March 2026 as their flagship model for professional and enterprise use. It is available in three variants — standard, Thinking, and Pro — and features a context window of 1 million tokens, the largest OpenAI has offered. The model is designed not only to plan complex tasks but to complete them reliably, with built-in computer use capabilities for orchestrating multi-step agentic workflows.

GPT-5.4 is best suited for enterprise teams running AI in production environments, including customer support automation, document drafting, data analysis, and developer workflows. It recorded an 83% score on GDPval for knowledge work tasks and ranked second out of 116 models on the Artificial Analysis Intelligence Index. The Pro variant adds multi-path reasoning evaluation for scenarios where analytical depth is prioritized over speed, such as scientific research and complex decision-making.

Capabilities

What GPT 5.4 supports

Agentic Workflows

Executes multi-step tasks autonomously using built-in computer use capabilities, including tool orchestration, file access, and data extraction with minimal human oversight.

CTX

1M Token Context

Supports a context window of up to 1 million tokens, enabling processing of extensive documents, large codebases, and long multi-turn sessions in a single request.

Extended Reasoning

The Thinking variant applies enhanced logical follow-through across long, complex interactions, maintaining consistency over extended reasoning chains.

Artifact Generation

Produces structured professional outputs including documents, spreadsheets, slide decks, financial models, and legal analyses in a single session.

Reduced Hallucinations

Delivers 33% fewer factual errors in individual claims compared to GPT-5.2, according to OpenAI's internal benchmarks.

Token-Efficient Output

Solves problems using fewer tokens than its predecessor, reducing latency and cost for high-volume production workloads.

</>

Code Generation

Generates, reviews, and debugs code across common programming languages, with support for developer workflows within the full 1M token context.

Deep Analytical Reasoning

The Pro variant uses multi-path reasoning evaluation to provide greater analytical depth for research, legal analysis, and complex decision-making tasks.

Pricing for GPT 5.4

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Input tokens $2.50 Per million tokens

Output tokens $15.00 Per million tokens

Price Comparison

Additional usage-cost dimensions synced into the project for this model.

Web search $10000.00

Cache read $0.25

maxTemperature 2

maxResponseSize 128,000 tokens

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

OpenAI Azure

Provider Endpoints

Endpoint-level provider data currently available for this model.

OpenAI

Max prompt: 922,000 Max output: 128,000 1d uptime: 98.9% Supported params: 8 Implicit caching: No

Azure

Max prompt: 922,000 Max output: 128,000 1d uptime: 99.7% Supported params: 8 Implicit caching: No

Model Performance

Benchmark scores synced from the current model source and normalized into the local catalog.

Benchmark	Score
ARC-AGI-2 Novel abstract reasoning and pattern recognition	73.3%
BrowseComp Complex web browsing and information retrieval	82.7%
GPQA Diamond PhD-level science questions (biology, physics, chemistry)	92.0%
HLE Questions that challenge frontier models across many domains	41.6%
OSWorld-Verified Autonomous computer use and desktop tasks	75.0%
SciCode Scientific research coding and numerical methods	56.6%
SWE-bench Pro Challenging real-world software engineering tasks	57.7%
Terminal-Bench 2.0 Agentic coding and terminal command tasks	75.1%

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

TechCrunch Launch Article Announcements

→

Microsoft Foundry Announcement Announcements

→

Artificial Analysis Model Profile Other

→

OpenAI Platform Documentation Documentation

→

OpenAI API Reference Documentation

→

OpenAI Playground Playground

→

Official Website

→

Usage Policies

→

Enterprise privacy at OpenAI

→

OpenAI Status Page

→

OpenRouter Model Page OpenRouter

→

Compare GPT 5.4 with related models

Jump straight into the most relevant side-by-side comparison pages for this model.

GPT 5.4 vs GPT 5.4 Pro

Compare GPT 5.4 and GPT 5.4 Pro across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for long-context workloads versus long-context workloads.

GPT 5.5 vs GPT 5.4

Compare GPT 5.5 and GPT 5.4 across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for long-context workloads versus long-context workloads.

AI tools related to GPT 5.4

These tools are strongly connected to GPT 5.4 through direct product references, provider mentions, or explicit model mappings.

AI Assistant

MaxAI.me

MaxAI.me is a Chrome and Edge extension designed to boost productivity by offering one-click AI tools for summarizing, searching, explaining, analyzing, translating, and writing content across any website. It supports major AI providers, including ChatGPT, Google Bard, Bing Chat AI, and Claude, and integrates with ChatGPT Plus features like GPT-4, Web Browsing, Code Interpreter, and Plugins. Users can also utilize their own OpenAI API key to access models such as GPT-4, GPT-3.5-turbo-16k, and GPT-4-32k. Additionally, the extension provides one-click ChatGPT prompts tailored for marketing, sales, copywriting, operations, productivity, and customer support.

Free 0 visits 5 saves

AI Chatbot

ChatGPT Phantom: Lofi Tutor

ChatGPT Phantom: Lofi Tutor is a Chrome extension that integrates AI models, including ChatGPT, Bing Chat, and Google Bard, to support writing and coding tasks. By leveraging real-time data—specifically from YouTube—it provides an advanced search experience for generating customized news articles and video scripts, serving as an alternative to traditional search engines.

Free 0 visits 4 saves

AI Assistant

Powerly.ai

Powerly.ai is a no-code platform designed for building custom ChatGPT-powered chatbots. It provides white-label solutions that allow users to create branded AI assistants for customer support, sales, and content generation. Users can integrate their own OpenAI API keys, train bots on custom data, utilize interactive video guides, and embed unlimited chatbots into websites and mobile applications.

Free 0 visits 1 saves

AI Assistant

GPT Omni

GPT Omni (gptomni.ai) offers a free, accessible web interface for interacting with the GPT-4o model. Designed for ease of use, it allows users to engage in AI conversations without technical requirements. By leveraging OpenAI's GPT-4o, the platform supports text, audio, and visual inputs, providing real-time audio responses, improved multilingual capabilities, and advanced vision features to make AI technology widely available.

Free 0 visits 7 saves

Community discussion

What people think about GPT 5.4

GPT 5.4 discussions are most active in r/singularity, r/codex, r/OpenAI. Top Reddit threads cluster around benchmark and model-comparison threads, coding workflow discussions.

The strongest match in this snapshot has 13105 upvotes and 912 comments.

r/codex 5 upvotes 14 comments April 29, 2026

Did GPT 5.4 get dumber or is GPT 5.5 just a lot better?

I've been using GPT 5.4 high (extra high on a few occasions) for planning and reviewing code. (I use GPT 5.4-mini for implementing the plans from 5.4). It's been great. Last week, I tried to resolve an issue with a home screen widget not displaying correctly on IOS. I tried twice with GPT 5.4 high. It couldn't fix the issue. I decided to give GPT 5.5 a try for the first time. It resolve the issue in one shot, it was pretty incredible.

However, in the past couple of days, I've noticed GPT 5.4 makes silly mistakes for example, it doesn't include tests for critical functions, for unit tests it doesn't mock correctly, some of the changes it proposes leads to build failures, etc. It didn't make mistakes like this before. This has caused me to start using 5.5 more often than I would like because of how expensive it is.

Am I the only one experiencing this?

Open Reddit thread

r/codex 59 upvotes 19 comments April 19, 2026

GPT-5.4 mini is a godsend

I'm on the $20 plan and was really struggling with the regular GPT-5.4 model. I exhausted my 5h limit within 1h and my weekly limit within 2-3 days.

But with mini I have yet to hit my 5h limit before it runs out! I'm currently mostly adding new features and debugging and not creating a code base from scratch. But even then it might be good enough if you work in small increments.

Open Reddit thread

r/OpenAI 9 upvotes 30 comments March 20, 2026

GPT-5.4 Nano is genuinely impressive, how’s your experience?

I’ve been using GPT-5.4 Nano and I’m honestly blown away by how well it performs for being a smaller model. The speed feels great, and the output quality has been consistently strong for tasks I normally use larger models for.

What I’m curious about:

* What kinds of prompts/workflows are you getting the best results with?
* How does it compare to models you were using before (quality, latency, reliability)?
* Any “best practices” you’ve found, prompt style, system instructions, or tool usage, that really improve results?

Would love to hear your experience and any tips.

Open Reddit thread

r/OpenAI 2 upvotes 13 comments April 8, 2026

Can we talk about GPT 5.4 Mini for a second?

The price-to-performance ratio is actually insane. It’s a total powerhouse for next to nothing, yet everyone is still busy glazing Claude??

Make it make sense.

Open Reddit thread

r/GithubCopilot 57 upvotes 15 comments April 3, 2026

gpt 5.4 mini is EXTREMELY request efficient

I use gpt 5.3 codex for the research/plan phase and use 5.4 mini to execute. it will use like .5% max even for huge refactors/changes

in terms of planning it is kinda dumb even on high reasoning so use a different model for it. but with a detailed plan, it is REALLY good for execution. quite fast as well

Open Reddit thread

View more discussions →

FAQ

Common questions about GPT 5.4

What is the context window for GPT-5.4?

GPT-5.4 supports a context window of up to 1 million tokens, which allows it to process large documents, codebases, and extended multi-step workflows within a single session.

What are the differences between GPT-5.4, GPT-5.4 Thinking, and GPT-5.4 Pro?

The standard GPT-5.4 is designed for general professional and enterprise use. GPT-5.4 Thinking is optimized for tasks requiring enhanced logical reasoning across long interactions. GPT-5.4 Pro adds multi-path reasoning evaluation and greater analytical depth, making it suited for scientific research and complex decision-making where thoroughness is prioritized over speed.

What is the training data cutoff for GPT-5.4?

According to the available metadata, GPT-5.4 has a training date of March 2026. A more specific knowledge cutoff date has not been confirmed in the provided metadata.

What benchmarks has GPT-5.4 been evaluated on?

GPT-5.4 has been evaluated on OSWorld-Verified and WebArena Verified for computer use tasks, GDPval where it scored 83% for knowledge work, and Mercor's APEX-Agents benchmark for professional skills in law and finance. It ranks second out of 116 models on the Artificial Analysis Intelligence Index.

What types of tasks is GPT-5.4 best suited for?

GPT-5.4 is designed for enterprise production environments and is well-suited for customer support automation, document drafting, data analysis, developer workflows, agentic task execution, and extended reasoning tasks. The Pro variant is additionally suited for scientific research and scenarios requiring deep analytical work.

More models from OpenAI

Continue browsing adjacent models from the same provider.

← All AI Models