Summary of Learning to Grok: Emergence Of In-context Learning and Skill Composition in Modular Arithmetic Tasks, by Tianyu He et al.
Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasksby Tianyu…
Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasksby Tianyu…
Pretrained Mobility Transformer: A Foundation Model for Human Mobilityby Xinhua Wu, Haoyu He, Yanchao Wang,…
Landscape-Aware Growing: The Power of a Little LAGby Stefani Karp, Nikunj Saunshi, Sobhan Miryoosefi, Sashank…
A Survey of Transformer Enabled Time Series Synthesisby Alexander Sommers, Logan Cummins, Sudip Mittal, Shahram…
AROMA: Preserving Spatial Structure for Latent PDE Modeling with Local Neural Fieldsby Louis Serrano, Thomas…
Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learningby Jiahang Cao, Qiang…
Explicitly Encoding Structural Symmetry is Key to Length Generalization in Arithmetic Tasksby Mahdi Sabbaghi, George…
Learning-to-Cache: Accelerating Diffusion Transformer via Layer Cachingby Xinyin Ma, Gongfan Fang, Michael Bi Mi, Xinchao…
CE-NAS: An End-to-End Carbon-Efficient Neural Architecture Search Frameworkby Yiyang Zhao, Yunzhuo Liu, Bo Jiang, Tian…
Universal In-Context Approximation By Prompting Fully Recurrent Modelsby Aleksandar Petrov, Tom A. Lamb, Alasdair Paren,…