Summary of Double Successive Over-relaxation Q-learning with An Extension to Deep Reinforcement Learning, by Shreyas S R
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learningby Shreyas S RFirst submitted…
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learningby Shreyas S RFirst submitted…
Simplex-enabled Safe Continual Learning Machineby Hongpeng Cao, Yanbing Mao, Yihao Cai, Lui Sha, Marco CaccamoFirst…
State-Novelty Guided Action Persistence in Deep Reinforcement Learningby Jianshu Hu, Paul Weng, Yutong BanFirst submitted…
BAMDP Shaping: a Unified Theoretical Framework for Intrinsic Motivation and Reward Shapingby Aly Lidayan, Michael…
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churnby Hongyao…
Reward-Directed Score-Based Diffusion Models via q-Learningby Xuefeng Gao, Jiale Zha, Xun Yu ZhouFirst submitted to…
Sample and Oracle Efficient Reinforcement Learning for MDPs with Linearly-Realizable Value Functionsby Zakaria MhammediFirst submitted…
Soft Actor-Critic with Beta Policy via Implicit Reparameterization Gradientsby Luca Della LiberaFirst submitted to arxiv…
LMGT: Optimizing Exploration-Exploitation Balance in Reinforcement Learning through Language Model Guided Trade-offsby Yongxin Deng, Xihe…
Gaussian-Mixture-Model Q-Functions for Reinforcement Learning by Riemannian Optimizationby Minh Vu, Konstantinos SlavakisFirst submitted to arxiv…