Summary of Stein Variational Evolution Strategies, by Cornelius V. Braun et al.
Stein Variational Evolution Strategiesby Cornelius V. Braun, Robert T. Lange, Marc ToussaintFirst submitted to arxiv…
Stein Variational Evolution Strategiesby Cornelius V. Braun, Robert T. Lange, Marc ToussaintFirst submitted to arxiv…
Provable Acceleration of Nesterov’s Accelerated Gradient for Rectangular Matrix Factorization and Linear Neural Networksby Zhenghao…
Efficient line search for optimizing Area Under the ROC Curve in gradient descentby Jadon Fowler,…
Towards Sharper Risk Bounds for Minimax Problemsby Bowei Zhu, Shaojie Li, Yong LiuFirst submitted to…
Simultaneous Weight and Architecture Optimization for Neural Networksby Zitong Huang, Mansooreh Montazerin, Ajitesh SrivastavaFirst submitted…
Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning?by Khashayar Gatmiry, Nikunj…
Randomized Asymmetric Chain of LoRA: The First Meaningful Theoretical Framework for Low-Rank Adaptationby Grigory Malinovsky,…
On the Convergence of (Stochastic) Gradient Descent for Kolmogorov–Arnold Networksby Yihang Gao, Vincent Y. F.…
On Barycenter Computation: Semi-Unbalanced Optimal Transport-based Method on Gaussiansby Ngoc-Hai Nguyen, Dung Le, Hoang-Phi Nguyen,…
Neural Reasoning Networks: Efficient Interpretable Neural Networks With Automatic Textual Explanationsby Stephen Carrow, Kyle Harper…