Summary of Decentralized Optimization in Time-varying Networks with Arbitrary Delays, by Tomas Ortega et al.
Decentralized Optimization in Time-Varying Networks with Arbitrary Delaysby Tomas Ortega, Hamid JafarkhaniFirst submitted to arxiv…
Decentralized Optimization in Time-Varying Networks with Arbitrary Delaysby Tomas Ortega, Hamid JafarkhaniFirst submitted to arxiv…
Learning to Continually Learn with the Bayesian Principleby Soochan Lee, Hyeonseong Jeon, Jaehyeon Son, Gunhee…
The Unified Balance Theory of Second-Moment Exponential Scaling Optimizers in Visual Tasksby Gongyue Zhang, Honghai…
A Hessian-Aware Stochastic Differential Equation for Modelling SGDby Xiang Li, Zebang Shen, Liang Zhang, Niao…
Adaptive debiased SGD in high-dimensional GLMs with streaming databy Ruijian Han, Lan Luo, Yuanhang Luo,…
Bias in Motion: Theoretical Insights into the Dynamics of Bias in SGD Trainingby Anchit Jain,…
Understanding Forgetting in Continual Learning with Linear Regressionby Meng Ding, Kaiyi Ji, Di Wang, Jinhui…
Matrix Low-Rank Approximation For Policy Gradient Methodsby Sergio Rozada, Antonio G. MarquesFirst submitted to arxiv…
Clip Body and Tail Separately: High Probability Guarantees for DPSGD with Heavy Tailsby Haichao Sha,…
Dual-Delayed Asynchronous SGD for Arbitrarily Heterogeneous Databy Xiaolu Wang, Yuchang Sun, Hoi-To Wai, Jun ZhangFirst…