Join Anthropic's Applied AI team for a session on evaluating AI agents.
Most teams shipping AI agents can't tell whether a new model actually improves their product.The hard part is that the products people are shipping now aren't single prompts. They're agents that call tools, retrieve context, and take several steps before a customer sees output, with a different way to fail at each step. We'll show how the teams we work with evaluate agents end to end, decide quickly whether a new model is worth switching to, and stay confident their product performs at the level customers expect.
You'll see real examples from startups building on Claude and leave with something you can put into practice the same week.

Applied AI @ Anthropic

Applied AI @ Anthropic
By submitting, you acknowledge the Anthropic Privacy Policy.
The recording of this webinar is not available yet.
What’s next:
What’s next:
Have questions? Feel free to contact us at partner-marketing@anthropic.com. We look forward to seeing you there!