Summary of Symmetric Reinforcement Learning Loss For Robust Learning on Diverse Tasks and Model Scales, by Ju-seung Byun et al.
Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scalesby Ju-Seung Byun,…
Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scalesby Ju-Seung Byun,…
Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Databy Yuhao Chen, Zhimu Wang,…
Hierarchical Uncertainty Exploration via Feedforward Posterior Treesby Elias Nehme, Rotem Mulayoff, Tomer MichaeliFirst submitted to…
BiSup: Bidirectional Quantization Error Suppression for Large Language Modelsby Minghui Zou, Ronghui Guo, Sai Zhang,…
LIRE: listwise reward enhancement for preference alignmentby Mingye Zhu, Yi Liu, Lei Zhang, Junbo Guo,…
WisPerMed at BioLaySumm: Adapting Autoregressive Large Language Models for Lay Summarization of Scientific Articlesby Tabea…