Summary of Efficient Preference-based Reinforcement Learning Via Aligned Experience Estimation, by Fengshuo Bai et al.
Efficient Preference-based Reinforcement Learning via Aligned Experience Estimationby Fengshuo Bai, Rui Zhao, Hongming Zhang, Sijia…