Summary of On the Transformations Across Reward Model, Parameter Update, and In-context Prompt, by Deng Cai and Huayang Li and Tingchen Fu and Siheng Li and Weiwen Xu and Shuaiyi Li and Bowen Cao and Zhisong Zhang and Xinting Huang and Leyang Cui and Yan Wang and Lemao Liu and Taro Watanabe and Shuming Shi
On the Transformations across Reward Model, Parameter Update, and In-Context Promptby Deng Cai, Huayang Li,…