Summary of The Narrow Gate: Localized Image-text Communication in Vision-language Models, by Alessandro Serra et al.
The Narrow Gate: Localized Image-Text Communication in Vision-Language Modelsby Alessandro Serra, Francesco Ortu, Emanuele Panizon,…
The Narrow Gate: Localized Image-Text Communication in Vision-Language Modelsby Alessandro Serra, Francesco Ortu, Emanuele Panizon,…
I Don’t Know: Explicit Modeling of Uncertainty with an [IDK] Tokenby Roi Cohen, Konstantin Dobler,…
SMI-Editor: Edit-based SMILES Language Model with Fragment-level Supervisionby Kangjie Zheng, Siyue Liang, Junwei Yang, Bin…
Transformers Can Navigate Mazes With Multi-Step Predictionby Niklas Nolte, Ouail Kitouni, Adina Williams, Mike Rabbat,…
Rethinking Time Series Forecasting with LLMs via Nearest Neighbor Contrastive Learningby Jayanie Bogahawatte, Sachith Seneviratne,…
VisionZip: Longer is Better but Not Necessary in Vision Language Modelsby Senqiao Yang, Yukang Chen,…
Understanding Hidden Computations in Chain-of-Thought Reasoningby Aryasomayajula Ram BharadwajFirst submitted to arxiv on: 5 Dec…
BatchLLM: Optimizing Large Batched LLM Inference with Global Prefix Sharing and Throughput-oriented Token Batchingby Zhen…
T-REG: Preference Optimization with Token-Level Reward Regularizationby Wenxuan Zhou, Shujian Zhang, Lingxiao Zhao, Tao MengFirst…
X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Modelsby Zeyi Sun, Ziyang…