Claude Sonnet 4
Hybrid reasoning model with superior intelligence for high-volume use cases, and 200K context window
Announcements
- New
Claude Sonnet 4
Claude Sonnet 4 improves on Claude Sonnet 3.7 across a variety of areas, especially coding. It offers frontier performance that’s practical for most AI use cases, including user-facing AI assistants and high-volume tasks.
Read more
Claude Sonnet 3.7 and Claude Code
Feb 24, 2025
Claude Sonnet 3.7 is the first hybrid reasoning model and our most intelligent model to date. It’s state-of-the art for coding and delivers significant improvements in content generation, data analysis, and planning.
Read more
Availability and pricing
For business users and consumers who want to collaborate with Claude Sonnet 4 using a powerful chat experience, Claude Sonnet 4 is available on Claude for all users across the web, iOS, and Android.
For developers interested in building custom AI solutions with Claude Sonnet 4, it is available on the Anthropic API, Amazon Bedrock, and Google Cloud's Vertex AI.
Pricing for Claude Sonnet 4 starts at $3 per million input tokens and $15 per million output tokens, with up to 90% cost savings with prompt caching and 50% cost savings with batch processing. To learn more, check out our pricing page.
Use cases
Claude Sonnet 4 can understand nuanced instructions and context, recognize and correct its own mistakes, and create sophisticated analysis and insights from complex data. Combined with superior coding, vision, and writing skills, you can use Claude Sonnet 4 for a variety of use cases.
Claude Sonnet 4 can produce near-instant responses or extended, step-by-step thinking that is made visible to the user. API users also have fine-grained control over how long the model thinks for. Popular use cases include:
Customer-facing AI agents
Claude Sonnet 4 offers superior instruction following, tool selection, error correction, and advanced reasoning for customer-facing agents and complex AI workflows.
Code generation
Claude Sonnet 4 is a powerful choice for agentic coding, and can complete tasks across the entire software development lifecycle—from initial planning to bug fixes, maintenance to large refactors. It offers strong performance in both planning and solving for complex coding tasks, making it an ideal choice to power end-to-end software development processes.
Claude Sonnet 4 supports up to 64K output tokens, which is particularly valuable for rich code generation and planning.
Computer use
By integrating Claude via API, developers can direct Claude to use computers the way people do—by looking at a screen, moving a cursor, clicking buttons, and typing text. Claude 3.5 Sonnet was the first frontier AI model to be able to use computers in this way. Claude Sonnet 4 is an even more accurate model to reliably use computers in this way and we expect the capability to improve over time.
Advanced chatbots
With enhanced reasoning and a warm, human-like tone, Claude Sonnet 4 is ideal for chatbots that need to connect data and take action across a variety of systems and tools.
Knowledge Q&A
Claude Sonnet 4 offers a large context window and low rates of hallucination, making it ideal for answering questions around large knowledge bases, documents, and codebases.
Visual data extraction
Claude Sonnet 4 is able to extract information from visuals like charts, graphs, and complex diagrams with ease—making it an ideal AI model for data analytics and data science tasks.
Content generation and analysis
Claude Sonnet 4 excels at writing and is able to understand nuance and tone to generate more compelling content and analyze content on a deeper level.
Robotic process automation
Automate repetitive tasks or processes with Claude Sonnet 4. It offers industry-leading instruction following and is capable of handling complex processes and operations.
Benchmarks
Claude Sonnet 4 delivers superior intelligence across coding, agentic search, and AI agent capabilities.
Claude Sonnet 4 achieves strong performance across SWE-bench for coding, TAU-bench for agentic tool use, and more across traditional and agentic benchmarks.
Trust & Safety
We've conducted extensive testing and evaluation of Claude Sonnet 4, working with external experts to ensure it meets our standards for safety, security and reliability. In the model card for this release, we discuss new safety results in several categories.
What customers are saying
See Claude in action
Coding
What should I look for when reviewing a Pull Request for a Python web app?
Writing
Create a 3-month editorial calendar template for a weekly newsletter
Students
What's an effective study schedule template for final exams?
Frequently asked questions
We offer a family of Claude models across the spectrum of speed, price, and performance. Claude Sonnet 4 delivers superior intelligence with optimal efficiency for high-volume use cases. We recommend Claude Sonnet 4 for most AI applications where you need a balance of advanced capabilities and practical throughput—such as customer-facing agents, production coding workflows, content generation at scale, and real-time research tasks.
Pricing depends on how you want to use Claude Sonnet 4. To learn more, check out our pricing page.
Claude Sonnet 4 is both a standard model and a hybrid reasoning model in one: you can pick when you want the model to answer normally and when you want it to utilize extended thinking.
Extended thinking mode is best for use cases where performance and accuracy matter more than latency. It significantly improves response quality for complex reasoning tasks, extended agentic work, multi-step coding projects, and deep research—and the thinking summaries help you understand key aspects of the model's reasoning process.