Summary of Lagrangian Index Policy For Restless Bandits with Average Reward, by Konstantin Avrachenkov et al.
Lagrangian Index Policy for Restless Bandits with Average Rewardby Konstantin Avrachenkov, Vivek S. Borkar, Pratik…
Lagrangian Index Policy for Restless Bandits with Average Rewardby Konstantin Avrachenkov, Vivek S. Borkar, Pratik…
Asynchronous Distributed Gaussian Process Regression for Online Learning and Dynamical Systems: Complementary Documentby Zewen Yang,…
Doubly-Bounded Queue for Constrained Online Learning: Keeping Pace with Dynamics of Both Loss and Constraintby…
Proactive Model Adaptation Against Concept Drift for Online Time Series Forecastingby Lifan Zhao, Yanyan ShenFirst…
Learning from Snapshots of Discrete and Continuous Data Streamsby Pramith Devulapalli, Steve HannekeFirst submitted to…
Accurate Multi-Category Student Performance Forecasting at Early Stages of Online Education Using Neural Networksby Naveed…
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrationsby Cevahir Koprulu, Po-han Li, Tianyu Qiu,…
Combinatorial Rising Banditby Seockbean Song, Youngsik Yoon, Siwei Wang, Wei Chen, Jungseul OkFirst submitted to…
Act Now: A Novel Online Forecasting Framework for Large-Scale Streaming Databy Daojun Liang, Haixia Zhang,…
CAdam: Confidence-Based Optimization for Online Learningby Shaowen Wang, Anan Liu, Jian Xiao, Huan Liu, Yuekui…