Summary of Boosting Long-context Management Via Query-guided Activation Refilling, by Hongjin Qian et al.
Boosting Long-Context Management via Query-Guided Activation Refillingby Hongjin Qian, Zheng Liu, Peitian Zhang, Zhicheng Dou,…
Boosting Long-Context Management via Query-Guided Activation Refillingby Hongjin Qian, Zheng Liu, Peitian Zhang, Zhicheng Dou,…
Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-trainingby Mingjia Shi, Yuhao Zhou,…
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Reasoningby Hongbin Zhang, Kehai Chen,…
Beyond Data Quantity: Key Factors Driving Performance in Multilingual Language Modelsby Sina Bagheri Nezhad, Ameeta…
Solid-SQL: Enhanced Schema-linking based In-context Learning for Robust Text-to-SQLby Geling Liu, Yunzhi Tan, Ruichao Zhong,…
CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal Dynamicsby Ruixin Mao,…
Addressing Small and Imbalanced Medical Image Datasets Using Generative Models: A Comparative Study of DDPM…
A Scalable Approach to Benchmarking the In-Conversation Differential Diagnostic Accuracy of a Health AIby Deep…
LLMCL-GEC: Advancing Grammatical Error Correction with LLM-Driven Curriculum Learningby Tao Fang, Derek F. Wong, Lusheng…
SAModified: A Foundation Model-Based Zero-Shot Approach for Refining Noisy Land-Use Land-Cover Mapsby Sparsh Pekhale, Rakshith…