Summary of Lookupvit: Compressing Visual Information to a Limited Number Of Tokens, by Rajat Koner et al.
LookupViT: Compressing visual information to a limited number of tokensby Rajat Koner, Gagan Jain, Prateek…
LookupViT: Compressing visual information to a limited number of tokensby Rajat Koner, Gagan Jain, Prateek…
MEMO: Fine-grained Tensor Management For Ultra-long Context LLM Trainingby Pinxue Zhao, Hailin Zhang, Fangcheng Fu,…
Understanding Transformers via N-gram Statisticsby Timothy NguyenFirst submitted to arxiv on: 30 Jun 2024CategoriesMain: Computation…
PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculationby Branden Butler, Sixing Yu, Arya Mazaheri, Ali…
Counting in Small Transformers: The Delicate Interplay between Attention and Feed-Forward Layersby Freya Behrens, Luca…
MoESD: Mixture of Experts Stable Diffusion to Mitigate Gender Biasby Guorun Wang, Lucia SpeciaFirst submitted…
LLM Circuit Analyses Are Consistent Across Training and Scaleby Curt Tigges, Michael Hanna, Qinan Yu,…
Accessing Vision Foundation Models via ImageNet-1Kby Yitian Zhang, Xu Ma, Yue Bai, Huan Wang, Yun…
By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Promptingby Hyungjun…
Optimized Multi-Token Joint Decoding with Auxiliary Model for LLM Inferenceby Zongyue Qin, Ziniu Hu, Zifan…