Summary of Beware Of Calibration Data For Pruning Large Language Models, by Yixin Ji et al.
Beware of Calibration Data for Pruning Large Language Modelsby Yixin Ji, Yang Xiang, Juntao Li,…
Beware of Calibration Data for Pruning Large Language Modelsby Yixin Ji, Yang Xiang, Juntao Li,…
Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importanceby Mostafa Hussien, Mahmoud…
TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Samplingby Jiahao Qiu, Yifu Lu, Yifan…
Pruning Foundation Models for High Accuracy without Retrainingby Pu Zhao, Fei Sun, Xuan Shen, Pinrui…
DPVS-Shapley:Faster and Universal Contribution Evaluation Component in Federated Learningby Ketin Yin, Zonghao Guo, ZhengHan QinFirst…
Adaptive Pruning with Module Robustness Sensitivity: Balancing Compression and Robustnessby Lincen Bai, Hedi Tabia, Raúl…
FedSpaLLM: Federated Pruning of Large Language Modelsby Guangji Bai, Yijiang Li, Zilinghan Li, Liang Zhao,…
SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Modelsby Yuqi Li,…
Large Language Models Are Overparameterized Text Encodersby Thennal D K, Tim Fischer, Chris BiemannFirst submitted…
EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Searchby Oliver Sieberling, Denis Kuznedelev, Eldar Kurtic,…