Summary of Reinforcement Learning For Infinite-horizon Average-reward Linear Mdps Via Approximation by Discounted-reward Mdps, By Kihyuk Hong et al.
Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPsby Kihyuk Hong, Woojin…