Summary of Closing the Gap: Achieving Global Convergence (last Iterate) Of Actor-critic Under Markovian Sampling with Neural Network Parametrization, by Mudit Gaur and Amrit Singh Bedi and Di Wang and Vaneet Aggarwal
Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural…