Summary of Kan V.s. Mlp For Offline Reinforcement Learning, by Haihong Guo et al.
KAN v.s. MLP for Offline Reinforcement Learningby Haihong Guo, Fengxin Li, Jiao Li, Hongyan LiuFirst…
KAN v.s. MLP for Offline Reinforcement Learningby Haihong Guo, Fengxin Li, Jiao Li, Hongyan LiuFirst…
Curricula for Learning Robust Policies with Factored State Representations in Changing Environmentsby Panayiotis Panayiotou, Özgür…
Quantum-inspired Reinforcement Learning for Synthesizable Drug Designby Dannong Wang, Jintai Chen, Zhiding Liang, Tianfan Fu,…
Quasimetric Value Functions with Dense Rewardsby Khadichabonu Valieva, Bikramjit BanerjeeFirst submitted to arxiv on: 13…
Batch Ensemble for Variance Dependent Regret in Stochastic Banditsby Asaf Cassel, Orin Levy, Yishay MansourFirst…
CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasksby Tianlong Wang, Junzhe Chen,…
Multi-Model based Federated Learning Against Model Poisoning Attack: A Deep Learning Based Model Selection for…
Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learningby Hanyang Zhao,…
Learning Causally Invariant Reward Functions from Diverse Demonstrationsby Ivan Ovinnikov, Eugene Bykovets, Joachim M. BuhmannFirst…
Q-value Regularized Decision ConvFormer for Offline Reinforcement Learningby Teng Yan, Zhendong Ruan, Yaobang Cai, Yu…