Summary of Plad: Preference-based Large Language Model Distillation with Pseudo-preference Pairs, by Rongzhi Zhang et al.
PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairsby Rongzhi Zhang, Jiaming Shen, Tianqi Liu,…