Summary of Learning Diverse Policies with Soft Self-generated Guidance, by Guojian Wang et al.
Learning Diverse Policies with Soft Self-Generated Guidanceby Guojian Wang, Faguo Wu, Xiao Zhang, Jianxiang LiuFirst…
Learning Diverse Policies with Soft Self-Generated Guidanceby Guojian Wang, Faguo Wu, Xiao Zhang, Jianxiang LiuFirst…
OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequencesby Chen Wang, Sarah Erfani, Tansu Alpcan,…
A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Linear MDPsby Kihyuk Hong, Ambuj TewariFirst…
Read to Play (R2-Play): Decision Transformer with Multimodal Game Instructionby Yonggang Jin, Ge Zhang, Hao…
Informed Reinforcement Learning for Situation-Aware Traffic Rule Exceptionsby Daniel Bogdoll, Jing Qin, Moritz Nekolla, Ahmed…
Reinforcement Learning with Ensemble Model Predictive Safety Certificationby Sven Gronauer, Tom Haider, Felippe Schmoeller da…
MusicRL: Aligning Music Generation to Human Preferencesby Geoffrey Cideron, Sertan Girgin, Mauro Verzetti, Damien Vincent,…
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learningby Ruoqi Zhang, Ziwei Luo, Jens Sjölund,…
Return-Aligned Decision Transformerby Tsunehiko Tanaka, Kenshi Abe, Kaito Ariu, Tetsuro Morimura, Edgar Simo-SerraFirst submitted to…
In-context learning agents are asymmetric belief updatersby Johannes A. Schubert, Akshay K. Jagadish, Marcel Binz,…