Summary of Dictionary Learning Improves Patch-free Circuit Discovery in Mechanistic Interpretability: a Case Study on Othello-gpt, by Zhengfu He et al.
Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPTby Zhengfu…