Summary of Improving Steering Vectors by Targeting Sparse Autoencoder Features, By Sviatoslav Chalnev and Matthew Siu and Arthur Conmy
Improving Steering Vectors by Targeting Sparse Autoencoder Featuresby Sviatoslav Chalnev, Matthew Siu, Arthur ConmyFirst submitted…