Summary of Critical Data Size Of Language Models From a Grokking Perspective, by Xuekai Zhu et al.
Critical Data Size of Language Models from a Grokking Perspectiveby Xuekai Zhu, Yao Fu, Bowen…
Critical Data Size of Language Models from a Grokking Perspectiveby Xuekai Zhu, Yao Fu, Bowen…
AutoFT: Learning an Objective for Robust Fine-Tuningby Caroline Choi, Yoonho Lee, Annie Chen, Allan Zhou,…
Deciphering Textual Authenticity: A Generalized Strategy through the Lens of Large Language Semantics for Detecting…
Transduce: learning transduction grammars for string transformationby Francis Frydman, Philippe MangionFirst submitted to arxiv on:…
A Two-Scale Complexity Measure for Deep Learning Modelsby Massimiliano Datres, Gian Paolo Leonardi, Alessio Figalli,…
Leveraging Gradients for Unsupervised Accuracy Estimation under Distribution Shiftby Renchunzi Xie, Ambroise Odonnat, Vasilii Feofanov,…
Rigid Protein-Protein Docking via Equivariant Elliptic-Paraboloid Interface Predictionby Ziyang Yu, Wenbing Huang, Yang LiuFirst submitted…
lpNTK: Better Generalisation with Less Data via Sample Interaction During Learningby Shangmin Guo, Yi Ren,…
The Effect of Intrinsic Dataset Properties on Generalization: Unraveling Learning Differences Between Natural and Medical…
Personalized Federated Learning of Probabilistic Models: A PAC-Bayesian Approachby Mahrokh Ghoddousi Boroujeni, Andreas Krause, Giancarlo…