Summary of State-novelty Guided Action Persistence in Deep Reinforcement Learning, by Jianshu Hu et al.
State-Novelty Guided Action Persistence in Deep Reinforcement Learningby Jianshu Hu, Paul Weng, Yutong BanFirst submitted…
State-Novelty Guided Action Persistence in Deep Reinforcement Learningby Jianshu Hu, Paul Weng, Yutong BanFirst submitted…
BAMDP Shaping: a Unified Theoretical Framework for Intrinsic Motivation and Reward Shapingby Aly Lidayan, Michael…
Soft Actor-Critic with Beta Policy via Implicit Reparameterization Gradientsby Luca Della LiberaFirst submitted to arxiv…
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churnby Hongyao…
Reward-Directed Score-Based Diffusion Models via q-Learningby Xuefeng Gao, Jiale Zha, Xun Yu ZhouFirst submitted to…
Sample and Oracle Efficient Reinforcement Learning for MDPs with Linearly-Realizable Value Functionsby Zakaria MhammediFirst submitted…
LMGT: Optimizing Exploration-Exploitation Balance in Reinforcement Learning through Language Model Guided Trade-offsby Yongxin Deng, Xihe…
Gaussian-Mixture-Model Q-Functions for Reinforcement Learning by Riemannian Optimizationby Minh Vu, Konstantinos SlavakisFirst submitted to arxiv…
RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMsby Jiaxing Wu, Lin Ning,…
AGR: Age Group fairness Reward for Bias Mitigation in LLMsby Shuirong Cao, Ruoxi Cheng, Zhiqiang…