Summary of Understanding Warmup-stable-decay Learning Rates: a River Valley Loss Landscape Perspective, by Kaiyue Wen et al.
Understanding Warmup-Stable-Decay Learning Rates: A River Valley Loss Landscape Perspectiveby Kaiyue Wen, Zhiyuan Li, Jason…
Understanding Warmup-Stable-Decay Learning Rates: A River Valley Loss Landscape Perspectiveby Kaiyue Wen, Zhiyuan Li, Jason…
SePPO: Semi-Policy Preference Optimization for Diffusion Alignmentby Daoan Zhang, Guangchen Lan, Dong-Jun Han, Wenlin Yao,…
SparsePO: Controlling Preference Alignment of LLMs via Sparse Token Masksby Fenia Christopoulou, Ronald Cardenas, Gerasimos…
Tuning-Free Bilevel Optimization: New Algorithms and Convergence Analysisby Yifan Yang, Hao Ban, Minhui Huang, Shiqian…
A Simulation-Free Deep Learning Approach to Stochastic Optimal Controlby Mengjian Hua, Matthieu Laurière, Eric Vanden-EijndenFirst…
Taming Gradient Oversmoothing and Expansion in Graph Neural Networksby MoonJeong Park, Dongwoo KimFirst submitted to…
On the Optimization and Generalization of Two-layer Transformers with Sign Gradient Descentby Bingrui Li, Wei…
ImProver: Agent-Based Automated Proof Optimizationby Riyaz Ahuja, Jeremy Avigad, Prasad Tetali, Sean WelleckFirst submitted to…
Fast Training of Sinusoidal Neural Fields via Scaling Initializationby Taesun Yeom, Sangyoon Lee, Jaeho LeeFirst…
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHFby Zhaolin Gao, Wenhao Zhan, Jonathan…