Summary of Adaptive Pruning For Large Language Models with Structural Importance Awareness, by Haotian Zheng et al.
Adaptive Pruning for Large Language Models with Structural Importance Awarenessby Haotian Zheng, Jinke Ren, Yushan…
Adaptive Pruning for Large Language Models with Structural Importance Awarenessby Haotian Zheng, Jinke Ren, Yushan…
Holistic Adversarially Robust Pruningby Qi Zhao, Christian WressneggerFirst submitted to arxiv on: 19 Dec 2024CategoriesMain:…
A Comparative Study of Pruning Methods in Transformer-based Time Series Forecastingby Nicholas Kiefer, Arvid Weyrauch,…
Numerical Pruning for Efficient Autoregressive Modelsby Xuan Shen, Zhao Song, Yufa Zhou, Bo Chen, Jing…
Krony-PT: GPT2 compressed with Kronecker Productsby M. Ayoub Ben Ayad, Jelena Mitrovic, Michael GranitzerFirst submitted…
Scalable Temporal Anomaly Causality Discovery in Large Systems: Achieving Computational Efficiency with Binary Anomaly Flag…
QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Modelsby Changhai Zhou, Yuhua Zhou,…
Fast Track to Winning Tickets: Repowering One-Shot Pruning for Graph Neural Networksby Yanwei Yue, Guibin…
Score-matching-based Structure Learning for Temporal Data on Networksby Hao Chen, Kai Yi, Lin Liu, Yu…
Post-Training Statistical Calibration for Higher Activation Sparsityby Vui Seng Chua, Yujie Pan, Nilesh JainFirst submitted…