Summary of Kalmamba: Towards Efficient Probabilistic State Space Models For Rl Under Uncertainty, by Philipp Becker et al.
KalMamba: Towards Efficient Probabilistic State Space Models for RL under Uncertaintyby Philipp Becker, Niklas Freymuth,…
KalMamba: Towards Efficient Probabilistic State Space Models for RL under Uncertaintyby Philipp Becker, Niklas Freymuth,…
Open Problem: Order Optimal Regret Bounds for Kernel-Based Reinforcement Learningby Sattar VakiliFirst submitted to arxiv…
An Idiosyncrasy of Time-discretization in Reinforcement Learningby Kris De Asis, Richard S. SuttonFirst submitted to…
SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learningby Matthias Weissenbacher, Rishabh Agarwal, Yoshinobu KawaharaFirst submitted…
Behaviour Distillationby Andrei Lupu, Chris Lu, Jarek Liesen, Robert Tjarko Lange, Jakob FoersterFirst submitted to…
Direct Multi-Turn Preference Optimization for Language Agentsby Wentao Shi, Mengqi Yuan, Junkang Wu, Qifan Wang,…
A Benchmark Study of Deep-RL Methods for Maximum Coverage Problems over Graphsby Zhicheng Liang, Yu…
A General Control-Theoretic Approach for Reinforcement Learning: Theory and Algorithmsby Weiqin Chen, Mark S. Squillante,…
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Foldby Amrith…
MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Tradingby Chuqiao Zong, Chaojie Wang, Molei…