Summary of Hybrid Preference Optimization: Augmenting Direct Preference Optimization with Auxiliary Objectives, by Anirudhan Badrinath et al.
Hybrid Preference Optimization: Augmenting Direct Preference Optimization with Auxiliary Objectivesby Anirudhan Badrinath, Prabhat Agarwal, Jiajing…