Summary of Asymptotics Of Feature Learning in Two-layer Networks After One Gradient-step, by Hugo Cui et al.
Asymptotics of feature learning in two-layer networks after one gradient-stepby Hugo Cui, Luca Pesce, Yatin…
Asymptotics of feature learning in two-layer networks after one gradient-stepby Hugo Cui, Luca Pesce, Yatin…
Strong convexity-guided hyper-parameter optimization for flatter lossesby Rahul Yedida, Snehanshu SahaFirst submitted to arxiv on:…
Tighter Generalisation Bounds via Interpolationby Paul Viallard, Maxime Haddouche, Umut Şimşekli, Benjamin GuedjFirst submitted to…
Explaining Learned Reward Functions with Counterfactual Trajectoriesby Jan Wehner, Frans Oliehoek, Luciano Cavalcante SiebertFirst submitted…
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replayby Natasha Butt, Blazej Manczak, Auke Wiggers, Corrado…
On Provable Length and Compositional Generalizationby Kartik Ahuja, Amin MansouriFirst submitted to arxiv on: 7…
Group Distributionally Robust Dataset Distillation with Risk Minimizationby Saeed Vahidian, Mingyu Wang, Jianyang Gu, Vyacheslav…
LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Viewsby Yuji Roh, Qingyun Liu, Huan Gui,…
Amortized Planning with Large-Scale Transformers: A Case Study on Chessby Anian Ruoss, Grégoire Delétang, Sourabh…
PRES: Toward Scalable Memory-Based Dynamic Graph Neural Networksby Junwei Su, Difan Zou, Chuan WuFirst submitted…