Summary of Curriculum Learning with Quality-driven Data Selection, by Biao Wu et al.
Curriculum Learning with Quality-Driven Data Selectionby Biao Wu, Fang Meng, Ling ChenFirst submitted to arxiv…
Curriculum Learning with Quality-Driven Data Selectionby Biao Wu, Fang Meng, Ling ChenFirst submitted to arxiv…
Training-Free Exponential Context Extension via Cascading KV Cacheby Jeffrey Willette, Heejun Lee, Youngwan Lee, Myeongjae…
Bypass Back-propagation: Optimization-based Structural Pruning for Large Language Models via Policy Gradientby Yuan Gao, Zujing…
Markov Constraint as Large Language Model Surrogateby Alexandre Bonlarron, Jean-Charles RéginFirst submitted to arxiv on:…
ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Modelsby Xiang Meng, Kayhan…
When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Modelsby…
SwitchLoRA: Switched Low-Rank Adaptation Can Learn Full-Rank Informationby Kaiye Zhou, Shucheng Wang, Jun XuFirst submitted…
Parallelizing Linear Transformers with the Delta Rule over Sequence Lengthby Songlin Yang, Bailin Wang, Yu…
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterizationby Haoran You, Yipin Guo, Yichao Fu, Wei…
Block Transformer: Global-to-Local Language Modeling for Fast Inferenceby Namgyu Ho, Sangmin Bae, Taehyeon Kim, Hyunjik…