Summary of Hierarchical Universal Value Function Approximators, by Rushiv Arora
Hierarchical Universal Value Function Approximatorsby Rushiv AroraFirst submitted to arxiv on: 11 Oct 2024CategoriesMain: Machine…
Hierarchical Universal Value Function Approximatorsby Rushiv AroraFirst submitted to arxiv on: 11 Oct 2024CategoriesMain: Machine…
Can we hop in general? A discussion of benchmark selection and design using the Hopper…
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RLby Claas A Voelcker, Marcel Hussing, Eric Eaton,…
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficientby Wenlong Wang, Ivana Dusparic, Yucheng…
SOLD: Slot Object-Centric Latent Dynamics Models for Relational Manipulation Learning from Pixelsby Malte Mosbach, Jan…
Words as Beacons: Guiding RL Agents with High-Level Language Promptsby Unai Ruiz-Gonzalez, Alain Andres, Pedro…
Towards Sharper Risk Bounds for Minimax Problemsby Bowei Zhu, Shaojie Li, Yong LiuFirst submitted to…
Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learningby Xinran Li, Ling Pan, Jun ZhangFirst submitted…
Reinforcement Learning for Control of Non-Markovian Cellular Population Dynamicsby Josiah C. Kratz, Jacob AdamczykFirst submitted…
Exploring Natural Language-Based Strategies for Efficient Number Learning in Children through Reinforcement Learningby Tirthankar MittraFirst…