Summary of Ropo: Robust Preference Optimization For Large Language Models, by Xize Liang et al.
ROPO: Robust Preference Optimization for Large Language Modelsby Xize Liang, Chao Chen, Shuang Qiu, Jie…
ROPO: Robust Preference Optimization for Large Language Modelsby Xize Liang, Chao Chen, Shuang Qiu, Jie…
Distributionally Robust Alignment for Medical Federated Vision-Language Pre-training Under Data Heterogeneityby Zitao Shuai, Chenwei Wu,…
Enhancing Breast Cancer Diagnosis in Mammography: Evaluation and Integration of Convolutional Neural Networks and Explainable…
Personalized Federated Learning for Spatio-Temporal Forecasting: A Dual Semantic Alignment-Based Contrastive Approachby Qingxiang Liu, Sheng…
On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Modelsby Sean Farhat,…
GreedLlama: Performance of Financial Value-Aligned Large Language Models in Moral Reasoningby Jeffy Yu, Maximilian Huber,…
On the Scalability of Diffusion-based Text-to-Image Generationby Hao Li, Yang Zou, Ying Wang, Orchid Majumder,…
Advancing LLM Reasoning Generalists with Preference Treesby Lifan Yuan, Ganqu Cui, Hanbin Wang, Ning Ding,…
Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Modelsby Kyuyoung Kim, Jongheon Jeong, Minyong An, Mohammad Ghavamzadeh,…
DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learningby Mengfei Du, Binhao Wu, Jiwen…