Summary of Cross-domain Policy Adaptation by Capturing Representation Mismatch, By Jiafei Lyu et al.
Cross-Domain Policy Adaptation by Capturing Representation Mismatchby Jiafei Lyu, Chenjia Bai, Jingwen Yang, Zongqing Lu,…
Cross-Domain Policy Adaptation by Capturing Representation Mismatchby Jiafei Lyu, Chenjia Bai, Jingwen Yang, Zongqing Lu,…
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rateby Fan-Ming Luo, Zuolin Tu, Zefang Huang,…
iVideoGPT: Interactive VideoGPTs are Scalable World Modelsby Jialong Wu, Shaofeng Yin, Ningya Feng, Xu He,…
Cooperative Backdoor Attack in Decentralized Reinforcement Learning with Theoretical Guaranteeby Mengtong Gao, Yifei Zou, Zuyuan…
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Modelsby Cong Lu, Shengran Hu, Jeff…
Diffusion Actor-Critic with Entropy Regulatorby Yinuo Wang, Likun Wang, Yuxuan Jiang, Wenjun Zou, Tong Liu,…
Extracting Heuristics from Large Language Models for Reward Shaping in Reinforcement Learningby Siddhant Bhambri, Amrita…
Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPsby Kihyuk Hong, Woojin…
MallowsPO: Fine-Tune Your LLM with Preference Dispersionsby Haoxian Chen, Hanyang Zhao, Henry Lam, David Yao,…
Interpretable and Editable Programmatic Tree Policies for Reinforcement Learningby Hector Kohler, Quentin Delfosse, Riad Akrour,…