Summary of Model-based Policy Optimization Using Symbolic World Model, by Andrey Gorodetskiy et al.
Model-based Policy Optimization using Symbolic World Modelby Andrey Gorodetskiy, Konstantin Mironov, Aleksandr PanovFirst submitted to…
Model-based Policy Optimization using Symbolic World Modelby Andrey Gorodetskiy, Konstantin Mironov, Aleksandr PanovFirst submitted to…
Analyzing and Bridging the Gap between Maximizing Total Reward and Discounted Reward in Deep Reinforcement…
Geometric Active Exploration in Markov Decision Processes: the Benefit of Abstractionby Riccardo De Santi, Federico…
Reconfigurable Intelligent Surface Aided Vehicular Edge Computing: Joint Phase-shift Optimization and Multi-User Power Allocationby Kangwei…
PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methodsby WooJae Jeon, KangJun Lee, Jeewoo LeeFirst…
Data-Driven Estimation of Conditional Expectations, Application to Optimal Stopping and Reinforcement Learningby George V. MoustakidesFirst…
Maintenance Strategies for Sewer Pipes with Multi-State Degradation and Deep Reinforcement Learningby Lisandro A. Jimenez-Roa,…
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learningby Minjae Cho, Chuangchuang SunFirst submitted to arxiv…
Variable-Agnostic Causal Exploration for Reinforcement Learningby Minh Hoang Nguyen, Hung Le, Svetha VenkateshFirst submitted to…
Estimating Reaction Barriers with Deep Reinforcement Learningby Adittya PalFirst submitted to arxiv on: 17 Jul…