Summary of On Mesa-optimization in Autoregressively Trained Transformers: Emergence and Capability, by Chenyu Zheng et al.
On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capabilityby Chenyu Zheng, Wei Huang, Rongzhen Wang,…