Anthropic

Claude 4 Sonnet

Claude Sonnet 4 (claude-sonnet-4-20250514) is a text generation model developed by Anthropic and released on May 22, 2025. It sits in the mid-tier of Anthropic's Claude 4 model family, designed to balance capability with computational efficiency for production use. The model supports a 200,000-token context window and accepts text, images, and PDFs as input. It includes an optional extended thinking mode that allows the model to perform step-by-step reasoning when tasks require greater depth. Claude Sonnet 4 is built for high-volume workloads where consistent performance and reliability matter. It scores 72.7% on SWE-bench, reflecting strong performance on software engineering tasks such as code generation, debugging, and codebase navigation. The model also supports agentic tool use, making it suitable for multi-step workflows and integration with external APIs. Common use cases include code review, customer support automation, data analysis, and long-document processing.

May 22, 2025 200,000 context 64,000 tokens output
Hybrid Reasoning Code Generation Large Context Window Vision & Multimodal Input Agentic Tool Use Configurable Parameters

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

Anthropic

Model ID

The routed model identifier exposed by upstream providers.

anthropic/claude-sonnet-4

Input Context Window

The number of tokens supported by the input context window.

200,000 tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

64,000 tokens tokens

Open Source

Whether the model's code is available for public use.

No

Release Date

When the model was first released.

May 22, 2025 1 year ago

Knowledge Cut-off Date

When the model's knowledge was last updated.

2025-01-31

API Providers

The providers that offer this model. This is not an exhaustive list.

Google, Amazon Bedrock, Anthropic

Modalities

Types of data this model can process.

Text Image Code File

What is Claude 4 Sonnet

A fuller summary of positioning, capabilities, and source-specific details for Claude 4 Sonnet.

Claude Sonnet 4 (claude-sonnet-4-20250514) is a text generation model developed by Anthropic and released on May 22, 2025. It sits in the mid-tier of Anthropic's Claude 4 model family, designed to balance capability with computational efficiency for production use. The model supports a 200,000-token context window and accepts text, images, and PDFs as input. It includes an optional extended thinking mode that allows the model to perform step-by-step reasoning when tasks require greater depth.

Claude Sonnet 4 is built for high-volume workloads where consistent performance and reliability matter. It scores 72.7% on SWE-bench, reflecting strong performance on software engineering tasks such as code generation, debugging, and codebase navigation. The model also supports agentic tool use, making it suitable for multi-step workflows and integration with external APIs. Common use cases include code review, customer support automation, data analysis, and long-document processing.

Capabilities

What Claude 4 Sonnet supports

RN

Hybrid Reasoning

Supports both standard fast responses and an optional extended thinking mode that works through problems step by step, adapting reasoning depth to task complexity.

</>

Code Generation

Handles code generation, debugging, refactoring, and autonomous codebase navigation, achieving a 72.7% score on the SWE-bench benchmark.

CTX

Large Context Window

Processes up to 200,000 tokens in a single context, enabling analysis of large codebases, lengthy documents, and extended multi-turn conversations.

MM

Vision & Multimodal Input

Accepts images and PDFs alongside text, allowing the model to analyze visual content and documents within the same request.

AG

Agentic Tool Use

Supports tool calling and multi-step workflows, enabling integration with external APIs and use as a subagent in complex AI pipelines.

AI

Configurable Parameters

Exposes numeric configuration options such as temperature and token limits, giving developers fine-grained control over output behavior.

Pricing for Claude 4 Sonnet

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Price Comparison

Additional usage-cost dimensions synced into the project for this model.

Web search $10000.00
Cache read $0.30
Cache write $3.75
maxTemperature 1
maxResponseSize 64,000 tokens

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

Google Amazon Bedrock Anthropic

Provider Endpoints

Endpoint-level provider data currently available for this model.

Google

Max output: 64,000 1d uptime: 100.0% Supported params: 8 Implicit caching: No

Amazon Bedrock

Max output: 64,000 Supported params: 9 Implicit caching: No

Anthropic

Max output: 64,000 1d uptime: 99.9% Supported params: 8 Implicit caching: No

Amazon Bedrock

Max output: 64,000 1d uptime: 99.9% Supported params: 9 Implicit caching: No

Google

Max output: 64,000 Supported params: 8 Implicit caching: No

Google

Max output: 64,000 1d uptime: 96.6% Supported params: 8 Implicit caching: No

Configuration & Parameters

The configurable options currently documented for this model.

Reasoning

Select

When enabled, the model will explain its thought process step-by-step before providing a final answer. This can help users understand how the model arrived at its conclusions, but may result in longer responses.

Default: false
Disabled Enabled

Max Reasoning Size

Number

You can allocate a larger thinking budget to support more thorough reasoning. Must be less than max. response size

Range: 1024 - 32000

Supported Request Parameters

Parameters currently listed by OpenRouter or the local catalog for this model.

Reasoning Max Reasoning Size

Model Performance

Benchmark scores synced from the current model source and normalized into the local catalog.

Benchmark Score
AIME 2024
American math olympiad problems
40.7%
GPQA Diamond
PhD-level science questions (biology, physics, chemistry)
68.3%
HLE
Questions that challenge frontier models across many domains
4.0%
LiveCodeBench
Real-world coding tasks from recent competitions
44.9%
MATH-500
Undergraduate and competition-level math problems
93.4%
MMLU-Pro
Expert knowledge across 14 academic disciplines
83.7%
SciCode
Scientific research coding and numerical methods
37.3%

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Compare Claude 4 Sonnet with related models

Jump straight into the most relevant side-by-side comparison pages for this model.

Claude 4 Opus vs Claude 4 Sonnet

Compare Claude 4 Opus and Claude 4 Sonnet across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for reasoning-heavy tasks versus reasoning-heavy tasks.

Claude 4.8 Opus vs Claude 4 Sonnet

Compare Claude 4.8 Opus and Claude 4 Sonnet across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for reasoning-heavy tasks versus reasoning-heavy tasks.

Claude 4.7 Opus vs Claude 4 Sonnet

Compare Claude 4.7 Opus and Claude 4 Sonnet across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for long-context workloads versus reasoning-heavy tasks.

Claude 4.6 Sonnet vs Claude 4 Sonnet

Compare Claude 4.6 Sonnet and Claude 4 Sonnet across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for long-context workloads versus reasoning-heavy tasks.

Claude 4.6 Opus vs Claude 4 Sonnet

Compare Claude 4.6 Opus and Claude 4 Sonnet across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for long-context workloads versus reasoning-heavy tasks.

Claude 4.5 Sonnet vs Claude 4 Sonnet

Compare Claude 4.5 Sonnet and Claude 4 Sonnet across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for reasoning-heavy tasks versus reasoning-heavy tasks.

Community discussion

What people think about Claude 4 Sonnet

Claude 4 Sonnet discussions are most active in r/ClaudeAI, r/singularity, r/LocalLLaMA.

Top Reddit threads cluster around benchmark and model-comparison threads, safety and censorship questions, coding workflow discussions. The strongest match in this snapshot has 1892 upvotes and 443 comments.

r/ClaudeAI 1,892 upvotes 443 comments September 29, 2025
Introducing Claude Sonnet 4.5

https://preview.redd.it/lm1pxnzzl4sf1.png?width=2160&format=png&auto=webp&s=fe15e1db93ef31b6d39bf959715f67701ede2271

Introducing Claude Sonnet 4.5—the best coding model in the world. 

It's the strongest model for building complex agents, the best model for computer use, and it shows substantial gains on tests of reasoning and math.

We're also introducing upgrades across all Claude surfaces

**Claude Code**

* The terminal interface has a fresh new look
* The new VS Code extension brings Claude to your IDE. 
* The new checkpoints feature lets you confidently run large tasks and roll back instantly to a previous state, if needed

**Claude App**: 

* Claude can use code to analyze data, create files, and visualize insights in the files & formats you use. Now available to all paid plans in preview. 
* The Claude for Chrome extension is now available to everyone who joined the waitlist last month

**Claude Developer Platform**: 

* Run agents longer by automatically clearing stale context and using our new memory tool to store and consult more information.
* The Claude Agent SDK gives you access to the same core tools, context management systems, and permissions frameworks that power Claude Code

We're also releasing a temporary research preview called **"Imagine with Claude"**

* In this experiment, Claude generates software on the fly. No functionality is predetermined; no code is prewritten.
* Available to Max users for 5 days. [Try it out](https://claude.ai/imagine)

Claude Sonnet 4.5 is available everywhere today—on the Claude app and Claude Code, the Claude Developer Platform, natively and in Amazon Bedrock and Google Cloud's Vertex AI.

Pricing remains the same as Sonnet 4.

[Read the full announcement](https://www.anthropic.com/news/claude-sonnet-4-5)

Open Reddit thread
r/ClaudeAI 1,222 upvotes 229 comments February 17, 2026
This is Claude Sonnet 4.6: our most capable Sonnet model yet.

Claude Sonnet 4.6 is a full upgrade across coding, computer use, long-context reasoning, agent planning, knowledge work, and design. It also features a 1M token context window in beta.

Sonnet 4.6 has improved on benchmarks across the board. It approaches Opus-level intelligence at a price point that makes it practical for far more tasks.

It also shows a major improvement in computer use skills. Early users are seeing human-level capability in tasks like navigating a complex spreadsheet or filling out a multi-step web form.

Claude Sonnet 4.6 is available now on all plans, Cowork, Claude Code, our API, and all major cloud platforms. We've also upgraded our free tier to Sonnet 4.6 by default.

Learn more: [anthropic.com/news/claude-sonnet-4-6](http://anthropic.com/news/claude-sonnet-4-6)

Open Reddit thread
r/cursor 309 upvotes 177 comments July 3, 2025
Cursor 1.2 and Claude 4 Sonnet Rate Limit - Is This a Joke?

I’ve been using Cursor for a few months now, and honestly, I’m at my wit’s end. I just updated to version 1.2, and after **only three prompts** with Claude 4 Sonnet, I’m hit with the rate limit window. Three prompts! And suddenly, I can’t code with an AI agent anymore. This is beyond frustrating - I’ve paid $20 a month for the past four months, and this is what I get? It feels like a scam at this point.

What’s even more annoying is that Cursor advertises “unlimited” access to Claude 4 Sonnet for Pro users, but in reality, it’s anything but unlimited. I’ve seen posts where people are getting rate-limited after minimal usage, and some are even being forced to switch to the “Auto” option once their usage cap is reached. This is a huge downgrade from what was promised, and it’s making me question whether Cursor is even worth the subscription anymore.

I’m also keeping an eye on Grok 4 - Code, which might be another alternative worth exploring. Anything has to be better than dealing with Cursor’s constant rate limit issues and feeling like I’m throwing money away.

Has anyone else experienced this with Cursor 1.2? What are your thoughts on switching to Claude Code or other alternatives?

Open Reddit thread
View more discussions →
FAQ

Common questions about Claude 4 Sonnet

What is the context window size for Claude Sonnet 4?

Claude Sonnet 4 supports a context window of 200,000 tokens, which allows it to process large documents, long conversation histories, and extensive codebases in a single request.

What is the knowledge cutoff date for Claude Sonnet 4?

The model's training data has a cutoff of May 2025, as indicated in the model metadata.

What input types does Claude Sonnet 4 support?

Claude Sonnet 4 accepts text, images, and PDFs as input, making it capable of handling both natural language and visual or document-based content.

Does Claude Sonnet 4 support tool use and agentic workflows?

Yes. Claude Sonnet 4 is built to support tool calling, multi-step task execution, and integration with external APIs, making it suitable for use as an agent or subagent in automated pipelines.

What is the model identifier for Claude Sonnet 4?

The full model identifier is claude-sonnet-4-20250514. This is the version string used when calling the model via Anthropic's API.

More models from Anthropic

Continue browsing adjacent models from the same provider.

← All AI Models