Summary of Online Policy Distillation with Decision-attention, by Xinqiang Yu et al.
Online Policy Distillation with Decision-Attentionby Xinqiang Yu, Chuanguang Yang, Chengqing Yu, Libo Huang, Zhulin An,…
Online Policy Distillation with Decision-Attentionby Xinqiang Yu, Chuanguang Yang, Chengqing Yu, Libo Huang, Zhulin An,…
Reinforcement Learning for Intensity Control: An Application to Choice-Based Network Revenue Managementby Huiling Meng, Ningyuan…
Massively Multiagent Minigames for Training Generalist Agentsby Kyoung Whan Choe, Ryan Sullivan, Joseph SuárezFirst submitted…
Optimizing Automatic Differentiation with Deep Reinforcement Learningby Jamie Lohoff, Emre NeftciFirst submitted to arxiv on:…
Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learningby Xuehui Yu, Mhairi Dunion, Xin Li,…
Primitive Agentic First-Order Optimizationby R. SalaFirst submitted to arxiv on: 7 Jun 2024CategoriesMain: Machine Learning…
Stabilizing Extreme Q-learning by Maclaurin Expansionby Motoki Omura, Takayuki Osa, Yusuke Mukuta, Tatsuya HaradaFirst submitted…
On Minimizing Adversarial Counterfactual Error in Adversarial RLby Roman Belaire, Arunesh Sinha, Pradeep VarakanthamFirst submitted…
Reinforcement Learning and Regret Bounds for Admission Controlby Lucas Weber, Ana Bušić, Jiamin ZhuFirst submitted…
Optimization of geological carbon storage operations with multimodal latent dynamic model and deep reinforcement learningby…