Summary of State-separated Sarsa: a Practical Sequential Decision-making Algorithm with Recovering Rewards, by Yuto Tanimoto et al.
State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewardsby Yuto Tanimoto, Kenji FukumizuFirst submitted…