Summary of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment, By Xin Xiao et al.
Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignmentby Xin Xiao, Bohong Wu, Jiacong Wang,…
Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignmentby Xin Xiao, Bohong Wu, Jiacong Wang,…
Yuan 2.0-M32: Mixture of Experts with Attention Routerby Shaohua Wu, Jiangang Luo, Xi Chen, Lingjun…
LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-designby Rui Kong, Qiyang Li,…
Don’t Miss the Forest for the Trees: Attentional Vision Calibration for Large Vision Language Modelsby…
TokenUnify: Scalable Autoregressive Visual Pre-training with Mixture Token Predictionby Yinda Chen, Haoyuan Shi, Xiaoyu Liu,…
Empowering Character-level Text Infilling by Eliminating Sub-Tokensby Houxing Ren, Mingjie Zhan, Zhongyuan Wu, Hongsheng LiFirst…
SED: Self-Evaluation Decoding Enhances Large Language Models for Better Generationby Ziqin Luo, Haixia Han, Haokun…
Less is more: Summarizing Patch Tokens for efficient Multi-Label Class-Incremental Learningby Thomas De Min, Massimiliano…
GECKO: Generative Language Model for English, Code and Koreanby Sungwoo Oh, Donggyu KimFirst submitted to…
Let’s Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text…