Summary of Cutting Off the Head Ends the Conflict: a Mechanism For Interpreting and Mitigating Knowledge Conflicts in Language Models, by Zhuoran Jin et al.
Cutting Off the Head Ends the Conflict: A Mechanism for Interpreting and Mitigating Knowledge Conflicts…