Summary of Interpbench: Semi-synthetic Transformers For Evaluating Mechanistic Interpretability Techniques, by Rohan Gupta et al.
InterpBench: Semi-Synthetic Transformers for Evaluating Mechanistic Interpretability Techniquesby Rohan Gupta, Iván Arcuschin, Thomas Kwa, Adrià…