Summary of Verification-guided Shielding For Deep Reinforcement Learning, by Davide Corsi et al.
Verification-Guided Shielding for Deep Reinforcement Learningby Davide Corsi, Guy Amir, Andoni Rodriguez, Cesar Sanchez, Guy…
Verification-Guided Shielding for Deep Reinforcement Learningby Davide Corsi, Guy Amir, Andoni Rodriguez, Cesar Sanchez, Guy…
Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?by Denis Tarasov, Kirill Brilliantov,…
Deep Multi-Objective Reinforcement Learning for Utility-Based Infrastructural Maintenance Optimizationby Jesse van Remmerden, Maurice Kenter, Diederik…
Decoupling regularization from the action spaceby Sobhan Mohammadpour, Emma Frejinger, Pierre-Luc BaconFirst submitted to arxiv…
Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learningby Takayuki Osa, Tatsuya HaradaFirst…
Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learningby Donghu Kim, Hojoon Lee, Kyungmin Lee,…
STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language Modelsby Shreyas Basavatia, Keerthiram…
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learningby Utsav Singh, Pramit Bhattacharyya, Vinay…
ICU-Sepsis: A Benchmark MDP Built from Real Medical Databy Kartik Choudhary, Dhawal Gupta, Philip S.…
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RLby Qi Lv,…