Summary of Critiprefill: a Segment-wise Criticality-based Approach For Prefilling Acceleration in Llms, by Junlin Lv et al.
CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMsby Junlin Lv, Yuan Feng, Xike…
CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMsby Junlin Lv, Yuan Feng, Xike…
Unsupervised Domain Adaptation Via Data Pruningby Andrea Napoli, Paul WhiteFirst submitted to arxiv on: 18…
Multi-Grid Graph Neural Networks with Self-Attention for Computational Mechanicsby Paul Garnier, Jonathan Viquerat, Elie HachemFirst…
CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenariosby Luning Wang, Shiyao Li, Xuefei…
Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policiesby Dennis Gross, Helge SpiekerFirst submitted to arxiv…
HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruningby Tianyi Chen,…
An Efficient Privacy-aware Split Learning Framework for Satellite Communicationsby Jianfei Sun, Cong Wu, Shahid Mumtaz,…
Adaptive Error-Bounded Hierarchical Matrices for Efficient Neural Network Compressionby John Mango, Ronald KatendeFirst submitted to…
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruningby Jaeseong Lee, seung-won hwang, Aurick Qiao, Daniel F…
LAMP: Learnable Meta-Path Guided Adversarial Contrastive Learning for Heterogeneous Graphsby Siqing Li, Jin-Duk Park, Wei…