Summary of Adversarial Representation Engineering: a General Model Editing Framework For Large Language Models, by Yihao Zhang et al.
Adversarial Representation Engineering: A General Model Editing Framework for Large Language Modelsby Yihao Zhang, Zeming…