Summary of Towards Reliable Alignment: Uncertainty-aware Rlhf, by Debangshu Banerjee et al.
Towards Reliable Alignment: Uncertainty-aware RLHFby Debangshu Banerjee, Aditya GopalanFirst submitted to arxiv on: 31 Oct…
Towards Reliable Alignment: Uncertainty-aware RLHFby Debangshu Banerjee, Aditya GopalanFirst submitted to arxiv on: 31 Oct…
Multi-fidelity Machine Learning for Uncertainty Quantification and Optimizationby Ruda Zhang, Negin AlemazkoorFirst submitted to arxiv…
Dynamic Information Sub-Selection for Decision Supportby Hung-Tien Huang, Maxwell Lennon, Shreyas Bhat Brahmavar, Sean Sylvia,…
Learning and Transferring Sparse Contextual Bigrams with Linear Transformersby Yunwei Ren, Zixuan Wang, Jason D.…
COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferencesby Yixin Liu, Argyris Oikonomou, Weiqiang…
Federated Learning under Periodic Client Participation and Heterogeneous Data: A New Communication-Efficient Algorithm and Analysisby…
HiBO: Hierarchical Bayesian Optimization via Adaptive Search Space Partitioningby Wenxuan Li, Taiyi Wang, Eiko YonekiFirst…
Dual-Optimized Adaptive Graph Reconstruction for Multi-View Graph Clusteringby Zichen Wen, Tianyi Wu, Yazhou Ren, Yawen…
Planning and Learning in Risk-Aware Restless Multi-Arm Bandit Problemby Nima Akbarzadeh, Erick Delage, Yossiri AdulyasakFirst…
MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learningby Xujia Wang, Haiyan Zhao, Shuo…