Summary of Statistical Properties Of Robust Satisficing, by Zhiyi Li et al.
Statistical Properties of Robust Satisficingby Zhiyi Li, Yunbei Xu, Ruohan ZhanFirst submitted to arxiv on:…
Statistical Properties of Robust Satisficingby Zhiyi Li, Yunbei Xu, Ruohan ZhanFirst submitted to arxiv on:…
Performance of NPG in Countable State-Space Average-Cost RLby Yashaswini Murthy, Isaac Grosof, Siva Theja Maguluri,…
Group Robust Preference Optimization in Reward-free RLHFby Shyam Sundhar Ramesh, Yifan Hu, Iason Chaimalas, Viraj…
Linear Function Approximation as a Computationally Efficient Method to Solve Classical Reinforcement Learning Challengesby Hari…
Quantitative Convergences of Lie Group Momentum Optimizersby Lingkai Kong, Molei TaoFirst submitted to arxiv on:…
XPrompt:Explaining Large Language Model’s Generation via Joint Prompt Attributionby Yurui Chang, Bochuan Cao, Yujia Wang,…
Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedbackby Sanghyeon…
Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Headsby…
Few for Many: Tchebycheff Set Scalarization for Many-Objective Optimizationby Xi Lin, Yilu Liu, Xiaoyuan Zhang,…
Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Modelsby Masatoshi Uehara, Yulai…