Summary of Meta-reinforcement Learning with Universal Policy Adaptation: Provable Near-optimality Under All-task Optimum Comparator, by Siyuan Xu and Minghui Zhu
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparatorby Siyuan Xu, Minghui…