InterpretabilityResearchA Mathematical Framework for Transformer CircuitsDec 22, 2021Read PaperResearchA small number of samples can poison LLMs of any sizeOct 09, 2025ResearchPetri: An open-source auditing tool to accelerate AI safety researchOct 06, 2025ResearchBuilding AI for cyber defendersOct 03, 2025