Summary of Salsa: Soup-based Alignment Learning For Stronger Adaptation in Rlhf, by Atoosa Chegini et al.
SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHFby Atoosa Chegini, Hamid Kazemi, Iman Mirzadeh,…
SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHFby Atoosa Chegini, Hamid Kazemi, Iman Mirzadeh,…
Formal Theorem Proving by Rewarding LLMs to Decompose Proofs Hierarchicallyby Kefan Dong, Arvind Mahankali, Tengyu…
Decision Trees for Interpretable Clusters in Mixture Models and Deep Representationsby Maximilian Fleissner, Maedeh Zarvandi,…
PageRank Bandits for Link Predictionby Yikun Ban, Jiaru Zou, Zihao Li, Yunzhe Qi, Dongqi Fu,…
Supervised Score-Based Modeling by Gradient Boostingby Changyuan Zhao, Hongyang Du, Guangyuan Liu, Dusit NiyatoFirst submitted…
PedSleepMAE: Generative Model for Multimodal Pediatric Sleep Signalsby Saurav R. Pandey, Aaqib Saeed, Harlin LeeFirst…
A Multi-Granularity Supervised Contrastive Framework for Remaining Useful Life Prediction of Aero-enginesby Zixuan He, Ziqian…
SelfCodeAlign: Self-Alignment for Code Generationby Yuxiang Wei, Federico Cassano, Jiawei Liu, Yifeng Ding, Naman Jain,…
In-Context Fine-Tuning for Time-Series Foundation Modelsby Abhimanyu Das, Matthew Faw, Rajat Sen, Yichen ZhouFirst submitted…
Keep on Swimming: Real Attackers Only Need Partial Knowledge of a Multi-Model Systemby Julian Collado,…