Summary of Striking a Balance in Fairness For Dynamic Systems Through Reinforcement Learning, by Yaowei Hu et al.
Striking a Balance in Fairness for Dynamic Systems Through Reinforcement Learningby Yaowei Hu, Jacob Lear,…
Striking a Balance in Fairness for Dynamic Systems Through Reinforcement Learningby Yaowei Hu, Jacob Lear,…
Personalized Reinforcement Learning with a Budget of Policiesby Dmitry Ivanov, Omer Ben-PoratFirst submitted to arxiv…
Secrets of RLHF in Large Language Models Part II: Reward Modelingby Binghai Wang, Rui Zheng,…
An experimental evaluation of Deep Reinforcement Learning algorithms for HVAC controlby Antonio Manjavacas, Alejandro Campoy-Nieves,…
Bounds on the price of feedback for mistake-bounded online learningby Jesse Geneson, Linus TangFirst submitted…
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agentsby Quentin Delfosse, Sebastian Sztwiertnia, Mark Rothermel, Wolfgang…
Optimistic Model Rollouts for Pessimistic Offline Policy Optimizationby Yuanzhao Zhai, Yiying Li, Zijian Gao, Xudong…
Innate-Values-driven Reinforcement Learning for Cooperative Multi-Agent Systemsby Qin YangFirst submitted to arxiv on: 10 Jan…
The Distributional Reward Critic Framework for Reinforcement Learning Under Perturbed Rewardsby Xi Chen, Zhihui Zhu,…
ReACT: Reinforcement Learning for Controller Parametrization using B-Spline Geometriesby Thomas Rudolf, Daniel Flögel, Tobias Schürmann,…