Summary of Balance Reward and Safety Optimization For Safe Reinforcement Learning: a Perspective Of Gradient Manipulation, by Shangding Gu et al.
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulationby Shangding…