Summary of Cream: Consistency Regularized Self-rewarding Language Models, by Zhaoyang Wang et al.
CREAM: Consistency Regularized Self-Rewarding Language Modelsby Zhaoyang Wang, Weilei He, Zhiyuan Liang, Xuchao Zhang, Chetan…
CREAM: Consistency Regularized Self-Rewarding Language Modelsby Zhaoyang Wang, Weilei He, Zhiyuan Liang, Xuchao Zhang, Chetan…
Revisited Large Language Model for Time Series Analysis through Modality Alignmentby Liangwei Nathan Zheng, Chang…
DAQ: Density-Aware Post-Training Weight-Only Quantization For LLMsby Yingsong Luo, Ling ChenFirst submitted to arxiv on:…
Preference Optimization with Multi-Sample Comparisonsby Chaoqi Wang, Zhuokai Zhao, Chen Zhu, Karthik Abinav Sankararaman, Michal…
Improving Long-Text Alignment for Text-to-Image Diffusion Modelsby Luping Liu, Chao Du, Tianyu Pang, Zehan Wang,…
Understanding Likelihood Over-optimisation in Direct Alignment Algorithmsby Zhengyan Shi, Sander Land, Acyr Locatelli, Matthieu Geist,…
Data Quality Control in Federated Instruction-tuning of Large Language Modelsby Yaxin Du, Rui Ye, Fengting…
TSDS: Data Selection for Task-Specific Model Finetuningby Zifan Liu, Amin Karbasi, Theodoros RekatsinasFirst submitted to…
FedCCRL: Federated Domain Generalization with Cross-Client Representation Learningby Xinpeng Wang, Yongxin Guo, Xiaoying TangFirst submitted…
Tackling Dimensional Collapse toward Comprehensive Universal Domain Adaptationby Hung-Chieh Fang, Po-Yi Lu, Hsuan-Tien LinFirst submitted…