Hybrid Reasoning
Supports both standard fast responses and an optional extended thinking mode that works through problems step by step, adapting reasoning depth to task complexity.
Claude Sonnet 4 (claude-sonnet-4-20250514) is a text generation model developed by Anthropic and released on May 22, 2025. It sits in the mid-tier of Anthropic's Claude 4 model family, designed to balance capability with computational efficiency for production use. The model supports a 200,000-token context window and accepts text, images, and PDFs as input. It includes an optional extended thinking mode that allows the model to perform step-by-step reasoning when tasks require greater depth. Claude Sonnet 4 is built for high-volume workloads where consistent performance and reliability matter. It scores 72.7% on SWE-bench, reflecting strong performance on software engineering tasks such as code generation, debugging, and codebase navigation. The model also supports agentic tool use, making it suitable for multi-step workflows and integration with external APIs. Common use cases include code review, customer support automation, data analysis, and long-document processing.
High-signal model metadata in a structured two-column overview table.
The entity that provides this model.
The routed model identifier exposed by upstream providers.
The number of tokens supported by the input context window.
The number of tokens that can be generated by the model in a single request.
Whether the model's code is available for public use.
When the model was first released.
When the model's knowledge was last updated.
The providers that offer this model. This is not an exhaustive list.
Types of data this model can process.
A fuller summary of positioning, capabilities, and source-specific details for Claude 4 Sonnet.
Claude Sonnet 4 (claude-sonnet-4-20250514) is a text generation model developed by Anthropic and released on May 22, 2025. It sits in the mid-tier of Anthropic's Claude 4 model family, designed to balance capability with computational efficiency for production use. The model supports a 200,000-token context window and accepts text, images, and PDFs as input. It includes an optional extended thinking mode that allows the model to perform step-by-step reasoning when tasks require greater depth.
Claude Sonnet 4 is built for high-volume workloads where consistent performance and reliability matter. It scores 72.7% on SWE-bench, reflecting strong performance on software engineering tasks such as code generation, debugging, and codebase navigation. The model also supports agentic tool use, making it suitable for multi-step workflows and integration with external APIs. Common use cases include code review, customer support automation, data analysis, and long-document processing.
Supports both standard fast responses and an optional extended thinking mode that works through problems step by step, adapting reasoning depth to task complexity.
Handles code generation, debugging, refactoring, and autonomous codebase navigation, achieving a 72.7% score on the SWE-bench benchmark.
Processes up to 200,000 tokens in a single context, enabling analysis of large codebases, lengthy documents, and extended multi-turn conversations.
Accepts images and PDFs alongside text, allowing the model to analyze visual content and documents within the same request.
Supports tool calling and multi-step workflows, enabling integration with external APIs and use as a subagent in complex AI pipelines.
Exposes numeric configuration options such as temperature and token limits, giving developers fine-grained control over output behavior.
Primary API pricing shown in the same “quick compare” spirit as the reference page.
Additional usage-cost dimensions synced into the project for this model.
Places where this model is available, based on the synced detail-page metadata.
Endpoint-level provider data currently available for this model.
The configurable options currently documented for this model.
When enabled, the model will explain its thought process step-by-step before providing a final answer. This can help users understand how the model arrived at its conclusions, but may result in longer responses.
You can allocate a larger thinking budget to support more thorough reasoning. Must be less than max. response size
Parameters currently listed by OpenRouter or the local catalog for this model.
Benchmark scores synced from the current model source and normalized into the local catalog.
| Benchmark | Score |
|---|---|
|
AIME 2024
American math olympiad problems
|
|
|
GPQA Diamond
PhD-level science questions (biology, physics, chemistry)
|
|
|
HLE
Questions that challenge frontier models across many domains
|
|
|
LiveCodeBench
Real-world coding tasks from recent competitions
|
|
|
MATH-500
Undergraduate and competition-level math problems
|
|
|
MMLU-Pro
Expert knowledge across 14 academic disciplines
|
|
|
SciCode
Scientific research coding and numerical methods
|
Official model cards, release notes, docs, and other references synced from the source page.
Jump straight into the most relevant side-by-side comparison pages for this model.
Compare Claude 4 Opus and Claude 4 Sonnet across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for reasoning-heavy tasks versus reasoning-heavy tasks.
Compare Claude 4.8 Opus and Claude 4 Sonnet across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for reasoning-heavy tasks versus reasoning-heavy tasks.
Compare Claude 4.7 Opus and Claude 4 Sonnet across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for long-context workloads versus reasoning-heavy tasks.
Compare Claude 4.6 Sonnet and Claude 4 Sonnet across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for long-context workloads versus reasoning-heavy tasks.
Compare Claude 4.6 Opus and Claude 4 Sonnet across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for long-context workloads versus reasoning-heavy tasks.
Compare Claude 4.5 Sonnet and Claude 4 Sonnet across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for reasoning-heavy tasks versus reasoning-heavy tasks.
Claude 4 Sonnet discussions are most active in r/ClaudeAI, r/singularity, r/LocalLLaMA.
Top Reddit threads cluster around benchmark and model-comparison threads, safety and censorship questions, coding workflow discussions. The strongest match in this snapshot has 1892 upvotes and 443 comments.
https://preview.redd.it/lm1pxnzzl4sf1.png?width=2160&format=png&auto=webp&s=fe15e1db93ef31b6d39bf959715f67701ede2271
Introducing Claude Sonnet 4.5—the best coding model in the world.
It's the strongest model for building complex agents, the best model for computer use, and it shows substantial gains on tests of reasoning and math.
We're also introducing upgrades across all Claude surfaces
**Claude Code**
* The terminal interface has a fresh new look
* The new VS Code extension brings Claude to your IDE.
* The new checkpoints feature lets you confidently run large tasks and roll back instantly to a previous state, if needed
**Claude App**:
* Claude can use code to analyze data, create files, and visualize insights in the files & formats you use. Now available to all paid plans in preview.
* The Claude for Chrome extension is now available to everyone who joined the waitlist last month
**Claude Developer Platform**:
* Run agents longer by automatically clearing stale context and using our new memory tool to store and consult more information.
* The Claude Agent SDK gives you access to the same core tools, context management systems, and permissions frameworks that power Claude Code
We're also releasing a temporary research preview called **"Imagine with Claude"**
* In this experiment, Claude generates software on the fly. No functionality is predetermined; no code is prewritten.
* Available to Max users for 5 days. [Try it out](https://claude.ai/imagine)
Claude Sonnet 4.5 is available everywhere today—on the Claude app and Claude Code, the Claude Developer Platform, natively and in Amazon Bedrock and Google Cloud's Vertex AI.
Pricing remains the same as Sonnet 4.
[Read the full announcement](https://www.anthropic.com/news/claude-sonnet-4-5)
Claude Sonnet 4.6 is a full upgrade across coding, computer use, long-context reasoning, agent planning, knowledge work, and design. It also features a 1M token context window in beta.
Sonnet 4.6 has improved on benchmarks across the board. It approaches Opus-level intelligence at a price point that makes it practical for far more tasks.
It also shows a major improvement in computer use skills. Early users are seeing human-level capability in tasks like navigating a complex spreadsheet or filling out a multi-step web form.
Claude Sonnet 4.6 is available now on all plans, Cowork, Claude Code, our API, and all major cloud platforms. We've also upgraded our free tier to Sonnet 4.6 by default.
Learn more: [anthropic.com/news/claude-sonnet-4-6](http://anthropic.com/news/claude-sonnet-4-6)
[https://www.anthropic.com/news/claude-sonnet-4-5](https://www.anthropic.com/news/claude-sonnet-4-5)
https://www.an.com/news/claude-sonnet-4-5
I’ve been using Cursor for a few months now, and honestly, I’m at my wit’s end. I just updated to version 1.2, and after **only three prompts** with Claude 4 Sonnet, I’m hit with the rate limit window. Three prompts! And suddenly, I can’t code with an AI agent anymore. This is beyond frustrating - I’ve paid $20 a month for the past four months, and this is what I get? It feels like a scam at this point.
What’s even more annoying is that Cursor advertises “unlimited” access to Claude 4 Sonnet for Pro users, but in reality, it’s anything but unlimited. I’ve seen posts where people are getting rate-limited after minimal usage, and some are even being forced to switch to the “Auto” option once their usage cap is reached. This is a huge downgrade from what was promised, and it’s making me question whether Cursor is even worth the subscription anymore.
I’m also keeping an eye on Grok 4 - Code, which might be another alternative worth exploring. Anything has to be better than dealing with Cursor’s constant rate limit issues and feeling like I’m throwing money away.
Has anyone else experienced this with Cursor 1.2? What are your thoughts on switching to Claude Code or other alternatives?
Claude Sonnet 4 supports a context window of 200,000 tokens, which allows it to process large documents, long conversation histories, and extensive codebases in a single request.
The model's training data has a cutoff of May 2025, as indicated in the model metadata.
Claude Sonnet 4 accepts text, images, and PDFs as input, making it capable of handling both natural language and visual or document-based content.
Yes. Claude Sonnet 4 is built to support tool calling, multi-step task execution, and integration with external APIs, making it suitable for use as an agent or subagent in automated pipelines.
The full model identifier is claude-sonnet-4-20250514. This is the version string used when calling the model via Anthropic's API.
Continue browsing adjacent models from the same provider.