Summary of Not All Heads Matter: a Head-level Kv Cache Compression Method with Integrated Retrieval and Reasoning, by Yu Fu et al.
Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasoningby…
Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasoningby…
FairQueue: Rethinking Prompt Learning for Fair Text-to-Image Generationby Christopher T.H Teo, Milad Abdollahzadeh, Xinda Ma,…
DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinationsby Aryo Pradipta Gema, Chen Jin, Ahmed…
Integrating Canonical Neural Units and Multi-Scale Training for Handwritten Text Recognitionby Zi-Rui WangFirst submitted to…
On Explaining with Attention Matricesby Omar Naim, Nicholas AsherFirst submitted to arxiv on: 24 Oct…
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Modelsby Ziyu Liu, Yuhang Zang, Xiaoyi…
Emotion Recognition with Facial Attention and Objective Activation Functionsby Andrzej Miskow, Abdulrahman AltahhanFirst submitted to…
Order Matters: Exploring Order Sensitivity in Multimodal Large Language Modelsby Zhijie Tan, Xu Chu, Weiping…
Zero-Shot Vision-and-Language Navigation with Collision Mitigation in Continuous Environmentby Seongjun Jeong, Gi-Cheon Kang, Joochan Kim,…
PLDR-LLM: Large Language Model from Power Law Decoder Representationsby Burc GokdenFirst submitted to arxiv on:…