Summary of Sable: a Performant, Efficient and Scalable Sequence Model For Marl, by Omayma Mahjoub et al.
Sable: a Performant, Efficient and Scalable Sequence Model for MARLby Omayma Mahjoub, Sasha Abramowitz, Ruan…
Sable: a Performant, Efficient and Scalable Sequence Model for MARLby Omayma Mahjoub, Sasha Abramowitz, Ruan…
PreND: Enhancing Intrinsic Motivation in Reinforcement Learning through Pre-trained Network Distillationby Mohammadamin Davoodabadi, Negin Hashemi…
Stable Offline Value Function Learning with Bisimulation-based Representationsby Brahma S. Pavse, Yudong Chen, Qiaomin Xie,…
Sampling from Energy-based Policies using Diffusionby Vineet Jain, Tara Akhound-Sadegh, Siamak RavanbakhshFirst submitted to arxiv…
Scalable Reinforcement Learning-based Neural Architecture Searchby Amber Cassimon, Siegfried Mercelis, Kevin MetsFirst submitted to arxiv…
Adaptive teachers for amortized samplersby Minsu Kim, Sanghyeok Choi, Taeyoung Yun, Emmanuel Bengio, Leo Feng,…
Absolute State-wise Constrained Policy Optimization: High-Probability State-wise Constraints Satisfactionby Weiye Zhao, Feihan Li, Yifan Sun,…
Sparse Autoencoders Reveal Temporal Difference Learning in Large Language Modelsby Can Demircan, Tankred Saanum, Akshay…
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rankby Wenhao Zhan, Scott…
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretrainingby Jie Cheng, Ruixi Qiao, Yingwei Ma,…