Evals for AI Agents: How Product Builders Get the Most Out of Every New Model

Join Anthropic's Applied AI team for a session on evaluating AI agents.

Most teams shipping AI agents can't tell whether a new model actually improves their product.The hard part is that the products people are shipping now aren't single prompts. They're agents that call tools, retrieve context, and take several steps before a customer sees output, with a different way to fail at each step. We'll show how the teams we work with evaluate agents end to end, decide quickly whether a new model is worth switching to, and stay confident their product performs at the level customers expect.

You'll see real examples from startups building on Claude and leave with something you can put into practice the same week.

Featuring

Preston Tuggle

Applied AI @ Anthropic

Jimmy Chan

Applied AI @ Anthropic

What you’ll learn

Why single-turn evals miss most of what goes wrong in an AI agent, and what to measure instead
How to build a first eval set from real production failures
How to decide quickly whether a new model release is worth adopting for your product

Register now to attend

Register to watch recording

By submitting, you acknowledge the Anthropic Privacy Policy.

Thank you for registering to watch

The recording of this webinar is not available yet.

Thank you for registering to watch

Thank you for registering

We’re excited to have you join us on

July 14, 2026

10:00 am

What’s next:

You'll receive a calendar invite with the webinar link within the next few minutes
A reminder email will be sent 24 hours before the event
All attendees will receive a recording link within 48 hours after the webinar

What’s next:

You'll receive a calendar invite with the webinar link within the next few minutes
A reminder email will be sent 24 hours before the event
All attendees will receive a recording link within 48 hours after the webinar

Have questions? Feel free to contact us at partner-marketing@anthropic.com. We look forward to seeing you there!