Summary of Tso: Self-training with Scaled Preference Optimization, by Kaihui Chen et al.
TSO: Self-Training with Scaled Preference Optimizationby Kaihui Chen, Hao Yi, Qingyang Li, Tianyu Qi, Yulan…
TSO: Self-Training with Scaled Preference Optimizationby Kaihui Chen, Hao Yi, Qingyang Li, Tianyu Qi, Yulan…