Summary of Online Mdp with Transition Prototypes: a Robust Adaptive Approach, by Shuo Sun et al.
Online MDP with Transition Prototypes: A Robust Adaptive Approachby Shuo Sun, Meng Qi, Zuo-Jun Max…
Online MDP with Transition Prototypes: A Robust Adaptive Approachby Shuo Sun, Meng Qi, Zuo-Jun Max…
Alignment faking in large language modelsby Ryan Greenblatt, Carson Denison, Benjamin Wright, Fabien Roger, Monte…
Future Research Avenues for Artificial Intelligence in Digital Gaming: An Exploratory Reportby Markus DablanderFirst submitted…
Adaptive Concept Bottleneck for Foundation Models Under Distribution Shiftsby Jihye Choi, Jayaram Raghuram, Yixuan Li,…
Machine Learning Co-pilot for Screening of Organic Molecular Additives for Perovskite Solar Cellsby Yang Pu,…
Trustworthy Transfer Learning: A Surveyby Jun Wu, Jingrui HeFirst submitted to arxiv on: 18 Dec…
jinns: a JAX Library for Physics-Informed Neural Networksby Hugo Gangloff, Nicolas JouvinFirst submitted to arxiv…
On Calibration in Multi-Distribution Learningby Rajeev Verma, Volker Fischer, Eric NalisnickFirst submitted to arxiv on:…
Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspectiveby Zhiyuan…
VideoDPO: Omni-Preference Alignment for Video Diffusion Generationby Runtao Liu, Haoyu Wu, Zheng Ziqiang, Chen Wei,…