Summary of Aim: Adaptive Inference Of Multi-modal Llms Via Token Merging and Pruning, by Yiwu Zhong et al.
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruningby Yiwu Zhong, Zhuoming Liu,…
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruningby Yiwu Zhong, Zhuoming Liu,…
Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retrainingby Zongru Wu, Pengzhou Cheng,…
[CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Fasterby…
RandAR: Decoder-only Autoregressive Visual Generation in Random Ordersby Ziqi Pang, Tianyuan Zhang, Fujun Luan, Yunze…
LLMs4Life: Large Language Models for Ontology Learning in Life Sciencesby Nadeen Fathallah, Steffen Staab, Alsayed…
PainterNet: Adaptive Image Inpainting with Actual-Token Attention and Diverse Mask Controlby Ruichen Wang, Junliang Zhang,…
AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignmentby Yan Li, Yifei Xing, Xiangyuan…
LLMs as mirrors of societal moral standards: reflection of cultural divergence and agreement across ethical…
Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decodingby Ziyin Zhang,…
Cross-modal Information Flow in Multimodal Large Language Modelsby Zhi Zhang, Srishti Yadav, Fengze Han, Ekaterina…