Summary of Optimistic Critic Reconstruction and Constrained Fine-tuning For General Offline-to-online Rl, by Qin-wen Luo et al.
Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RLby Qin-Wen Luo, Ming-Kun Xie, Ye-Wen…