Summary of Pre-trained Large Language Models Use Fourier Features to Compute Addition, by Tianyi Zhou et al.
Pre-trained Large Language Models Use Fourier Features to Compute Additionby Tianyi Zhou, Deqing Fu, Vatsal…
Pre-trained Large Language Models Use Fourier Features to Compute Additionby Tianyi Zhou, Deqing Fu, Vatsal…
Block Transformer: Global-to-Local Language Modeling for Fast Inferenceby Namgyu Ho, Sangmin Bae, Taehyeon Kim, Hyunjik…
Aligning Large Language Models via Fine-grained Supervisionby Dehong Xu, Liang Qiu, Minseok Kim, Faisal Ladhak,…
Contextual Counting: A Mechanistic Study of Transformers on a Quantitative Taskby Siavash Golkar, Alberto Bietti,…
Loki: Low-rank Keys for Efficient Sparse Attentionby Prajwal Singhania, Siddharth Singh, Shwai He, Soheil Feizi,…
Parrot: Multilingual Visual Instruction Tuningby Hai-Long Sun, Da-Wei Zhou, Yang Li, Shiyin Lu, Chao Yi,…
FFNet: MetaMixer-based Efficient Convolutional Mixer Designby Seokju Yun, Dongheon Lee, Youngmin RoFirst submitted to arxiv…
Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIPby Sriram Balasubramanian, Samyadeep Basu,…
TimeCMA: Towards LLM-Empowered Multivariate Time Series Forecasting via Cross-Modality Alignmentby Chenxi Liu, Qianxiong Xu, Hao…
Understanding Token Probability Encoding in Output Embeddingsby Hakaze Cho, Yoshihiro Sakai, Kenshiro Tanaka, Mariko Kato,…