Summary of Emergence Of Meta-stable Clustering in Mean-field Transformer Models, by Giuseppe Bruno et al.
Emergence of meta-stable clustering in mean-field transformer modelsby Giuseppe Bruno, Federico Pasqualotto, Andrea AgazziFirst submitted…
Emergence of meta-stable clustering in mean-field transformer modelsby Giuseppe Bruno, Federico Pasqualotto, Andrea AgazziFirst submitted…
Provably Optimal Memory Capacity for Modern Hopfield Models: Transformer-Compatible Dense Associative Memories as Spherical Codesby…
Higher-order Cross-structural Embedding Model for Time Series Analysisby Guancen Lin, Cong Shen, Aijing LinFirst submitted…
Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptationby Samuele Peri, Alessio Russo, Gabor…
Toward Understanding In-context vs. In-weight Learningby Bryan Chan, Xinyi Chen, András György, Dale SchuurmansFirst submitted to…
WaveRoRA: Wavelet Rotary Route Attention for Multivariate Time Series Forecastingby Aobo Liang, Yan Sun, Nadra…
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasksby Thomas Schmied, Thomas…
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencodersby Viacheslav Surkov, Chris Wendler, Mikhail Terekhov,…
Robust and Unbounded Length Generalization in Autoregressive Transformer-Based Text-to-Speechby Eric Battenberg, RJ Skerry-Ryan, Daisy Stanton,…
Evaluating K-Fold Cross Validation for Transformer Based Symbolic Regression Modelsby Kaustubh Kislay, Shlok Singh, Soham…