Summary of Reinforcement Learning From Human Feedback with Active Queries, by Kaixuan Ji and Jiafan He and Quanquan Gu
Reinforcement Learning from Human Feedback with Active Queriesby Kaixuan Ji, Jiafan He, Quanquan GuFirst submitted…
Reinforcement Learning from Human Feedback with Active Queriesby Kaixuan Ji, Jiafan He, Quanquan GuFirst submitted…
Optimal Thresholding Linear Banditby Eduardo Ochoa Rivera, Ambuj TewariFirst submitted to arxiv on: 11 Feb…
Fourier Circuits in Neural Networks and Transformers: A Case Study of Modular Arithmetic with Multiple…
Less is More: Fewer Interpretable Region via Submodular Subset Selectionby Ruoyu Chen, Hua Zhang, Siyuan…
Unifying Invariance and Spuriousity for Graph Out-of-Distribution via Probability of Necessity and Sufficiencyby Xuexin Chen,…
Deinterleaving of Discrete Renewal Process Mixtures with Application to Electronic Support Measuresby Jean Pinsolle, Olivier…
Optimal and Efficient Algorithms for Decentralized Online Convex Optimizationby Yuanyu Wan, Tong Wei, Bo Xue,…
Evolving Restricted Boltzmann Machine-Kohonen Network for Online Clusteringby J. Senthilnath, Adithya Bhattiprolu, Ankur Singh, Bangjian…
Leveraging the Context through Multi-Round Interactions for Jailbreaking Attacksby Yixin Cheng, Markos Georgopoulos, Volkan Cevher,…
Better-than-KL PAC-Bayes Boundsby Ilja Kuzborskij, Kwang-Sung Jun, Yulian Wu, Kyoungseok Jang, Francesco OrabonaFirst submitted to…