Summary of Infinite-horizon Reinforcement Learning with Multinomial Logistic Function Approximation, by Jaehyun Park et al.
Infinite-Horizon Reinforcement Learning with Multinomial Logistic Function Approximationby Jaehyun Park, Junyeop Kwon, Dabeen LeeFirst submitted…