Claude Sonnet 4

Hybrid reasoning model with superior intelligence for high-volume use cases, and 200K context window

Announcements

New
Claude Sonnet 4
Claude Sonnet 4 improves on Claude Sonnet 3.7 across a variety of areas, especially coding. It offers frontier performance that’s practical for most AI use cases, including user-facing AI assistants and high-volume tasks.
Read more
Claude Sonnet 3.7 and Claude Code
Feb 24, 2025
Claude Sonnet 3.7 is the first hybrid reasoning model and our most intelligent model to date. It’s state-of-the art for coding and delivers significant improvements in content generation, data analysis, and planning.
Read more

Availability and pricing

For business users and consumers who want to collaborate with Claude Sonnet 4 using a powerful chat experience, Claude Sonnet 4 is available on Claude for all users across the web, iOS, and Android.

For developers interested in building custom AI solutions with Claude Sonnet 4, it is available on the Anthropic API, Amazon Bedrock, and Google Cloud's Vertex AI.

Pricing for Claude Sonnet 4 starts at $3 per million input tokens and $15 per million output tokens, with up to 90% cost savings with prompt caching and 50% cost savings with batch processing. To learn more, check out our pricing page.

Use cases

Claude Sonnet 4 can understand nuanced instructions and context, recognize and correct its own mistakes, and create sophisticated analysis and insights from complex data. Combined with superior coding, vision, and writing skills, you can use Claude Sonnet 4 for a variety of use cases.

Claude Sonnet 4 can produce near-instant responses or extended, step-by-step thinking that is made visible to the user. API users also have fine-grained control over how long the model thinks for. Popular use cases include:

Customer-facing AI agents

Claude Sonnet 4 offers superior instruction following, tool selection, error correction, and advanced reasoning for customer-facing agents and complex AI workflows.

Code generation

Claude Sonnet 4 is a powerful choice for agentic coding, and can complete tasks across the entire software development lifecycle—from initial planning to bug fixes, maintenance to large refactors. It offers strong performance in both planning and solving for complex coding tasks, making it an ideal choice to power end-to-end software development processes.

Claude Sonnet 4 supports up to 64K output tokens, which is particularly valuable for rich code generation and planning.

Computer use

By integrating Claude via API, developers can direct Claude to use computers the way people do—by looking at a screen, moving a cursor, clicking buttons, and typing text. Claude 3.5 Sonnet was the first frontier AI model to be able to use computers in this way. Claude Sonnet 4 is an even more accurate model to reliably use computers in this way and we expect the capability to improve over time.

Advanced chatbots

With enhanced reasoning and a warm, human-like tone, Claude Sonnet 4 is ideal for chatbots that need to connect data and take action across a variety of systems and tools.

Knowledge Q&A

Claude Sonnet 4 offers a large context window and low rates of hallucination, making it ideal for answering questions around large knowledge bases, documents, and codebases.

Visual data extraction

Claude Sonnet 4 is able to extract information from visuals like charts, graphs, and complex diagrams with ease—making it an ideal AI model for data analytics and data science tasks.

Content generation and analysis

Claude Sonnet 4 excels at writing and is able to understand nuance and tone to generate more compelling content and analyze content on a deeper level.

Robotic process automation

Automate repetitive tasks or processes with Claude Sonnet 4. It offers industry-leading instruction following and is capable of handling complex processes and operations.

Benchmarks

Claude Sonnet 4 delivers superior intelligence across coding, agentic search, and AI agent capabilities.

Claude Sonnet 4 achieves strong performance across SWE-bench for coding, TAU-bench for agentic tool use, and more across traditional and agentic benchmarks.

Trust & Safety

We've conducted extensive testing and evaluation of Claude Sonnet 4, working with external experts to ensure it meets our standards for safety, security and reliability. In the model card for this release, we discuss new safety results in several categories.

What customers are saying

Claude Sonnet 4 has soared in agentic scenarios and we're excited to introduce it as the base model for the new coding agent in GitHub Copilot. In early internal evaluations, the model demonstrated up to 10% improvement over the previous Sonnet generation, driven by adaptive tool use, precise instruction-following, and strong coding instincts.

Thomas DohmkeCEO of GitHub

Claude Sonnet 4 improves on Claude Sonnet 3.7 with faster performance and better context understanding that has really impressed us. We're excited to upgrade based on what we've seen in early testing.

Scott WuCo-founder & CEO of Cognition

Claude Opus 4 and Sonnet 4 are state of the art coding models. They're a leap forward in complex codebase understanding, and we expect developers will experience across the board capability improvements.

Michael TruellCo-founder and CEO of Cursor

Claude Sonnet 4's code quality is incredible—it stays on track longer, understands problems deeply, and creates elegant solutions instead of brute-forcing fixes. This shows promise to be a substantial leap in software development!

Beyang LiuCTO of Sourcegraph

Anthropic has again set the gold standard for code-generation models with Claude Opus 4 and Sonnet 4. With a modest shift in prompting, we've seen the new models deliver cleaner, more precise high-quality output.

Jared PalmerVP of AI at Vercel

Claude Sonnet 4's ability to follow complex, multi-step instructions and work through problems with clear chain-of-thought reasoning is remarkable. The aesthetics of the artifacts are really excellent—I've never seen anything like it.

Tao ZhangCofounder of Manus

Claude Sonnet 4 surpasses Sonnet 3.7 with higher success rates, more surgical code edits, and more tightly scoped changes. It works more carefully through complex tasks and delivers superior code quality—making it the ideal choice as the primary coding model in Augment Code.

Guy Gur-AriCo-founder of Augment Code

Claude Sonnet 4 delivers enhanced problem-solving capabilities and substantial improvements in large-scale codebase navigation—typically reducing errors from 20% to near zero. Also, unlike Sonnet 3.7, which increasingly overstates completion as complexity grows, Ombre maintains truthful implementation reporting throughout development—preserving the stable foundations essential for building complex applications autonomously feature-by-feature.

Sean WardCEO of iGent AI

Claude Sonnet 4 sets a new high water mark on our internal evals on generating formulas, and enables the reliable tool use our Claygent needs to power creative, data-driven outreach.

Jeff BargAI Engineering Lead at Clay

See Claude in action

Coding

What should I look for when reviewing a Pull Request for a Python web app?

Ask Claude

Writing

Create a 3-month editorial calendar template for a weekly newsletter

Ask Claude

Students

What's an effective study schedule template for final exams?

Ask Claude

Frequently asked questions

When should I use Claude Sonnet 4?

We offer a family of Claude models across the spectrum of speed, price, and performance. Claude Sonnet 4 delivers superior intelligence with optimal efficiency for high-volume use cases. We recommend Claude Sonnet 4 for most AI applications where you need a balance of advanced capabilities and practical throughput—such as customer-facing agents, production coding workflows, content generation at scale, and real-time research tasks.

How much does it cost to use Claude Sonnet 4?

Pricing depends on how you want to use Claude Sonnet 4. To learn more, check out our pricing page.

When should I use extended thinking?

Claude Sonnet 4 is both a standard model and a hybrid reasoning model in one: you can pick when you want the model to answer normally and when you want it to utilize extended thinking.

Extended thinking mode is best for use cases where performance and accuracy matter more than latency. It significantly improves response quality for complex reasoning tasks, extended agentic work, multi-step coding projects, and deep research—and the thinking summaries help you understand key aspects of the model's reasoning process.