InterpretabilityResearch

A Mathematical Framework for Transformer Circuits

Dec 22, 2021
Read Paper


Related content

Project Glasswing: An initial update

An early update on what we've learned from Project Glasswing.

Read more

2028: Two scenarios for global AI leadership

Our views on the AI competition between the US and China.

Read more

Teaching Claude why

New research on how we've reduced agentic misalignment.

Read more
A Mathematical Framework for Transformer Circuits \ Anthropic