Summary of Preference Learning Algorithms Do Not Learn Preference Rankings, by Angelica Chen et al.
Preference Learning Algorithms Do Not Learn Preference Rankingsby Angelica Chen, Sadhika Malladi, Lily H. Zhang,…
Preference Learning Algorithms Do Not Learn Preference Rankingsby Angelica Chen, Sadhika Malladi, Lily H. Zhang,…
CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learningby Yiping Wang, Yifang Chen, Wendan…
Dr-LLaVA: Visual Instruction Tuning with Symbolic Clinical Groundingby Shenghuan Sun, Alexander Schubert, Gregory M. Goldgof,…
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHFby Shicong Cen, Jincheng Mei,…
Self-Exploring Language Models: Active Preference Elicitation for Online Alignmentby Shenao Zhang, Donghan Yu, Hiteshi Sharma,…
X-VILA: Cross-Modality Alignment for Large Language Modelby Hanrong Ye, De-An Huang, Yao Lu, Zhiding Yu,…
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Modelsby Zhanhui Zhou, Zhixuan…
Inference-Time Alignment of Diffusion Models with Direct Noise Optimizationby Zhiwei Tang, Jiangweizhi Peng, Jiasheng Tang,…
Lisa: Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning Attackby Tiansheng Huang, Sihao…
It’s Not a Modality Gap: Characterizing and Addressing the Contrastive Gapby Abrar Fahim, Alex Murphy,…