Summary of From Algorithm to Hardware: a Survey on Efficient and Safe Deployment Of Deep Neural Networks, by Xue Geng et al.
From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networksby…
From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networksby…
FedGreen: Carbon-aware Federated Learning with Model Size Adaptationby Ali Abbasi, Fan Dong, Xin Wang, Henry…
Bayesian Federated Model Compression for Communication and Computation Efficiencyby Chengyu Xia, Danny H. K. Tsang,…
Enhancing Inference Efficiency of Large Language Models: Investigating Optimization Strategies and Architectural Innovationsby Georgy TyukinFirst…
Improve Knowledge Distillation via Label Revision and Data Selectionby Weichao Lan, Yiu-ming Cheung, Qing Xu,…
Instance-Aware Group Quantization for Vision Transformersby Jaehyeon Moon, Dohyung Kim, Junyong Cheon, Bumsub HamFirst submitted…
Are Compressed Language Models Less Subgroup Robust?by Leonidas Gee, Andrea Zugarini, Novi QuadriantoFirst submitted to…
Advancing IIoT with Over-the-Air Federated Learning: The Role of Iterative Magnitude Pruningby Fazal Muhammad Ali…
DiPaCo: Distributed Path Compositionby Arthur Douillard, Qixuan Feng, Andrei A. Rusu, Adhiguna Kuncoro, Yani Donchev,…
Adversarial Fine-tuning of Compressed Neural Networks for Joint Improvement of Robustness and Efficiencyby Hallgrimur Thorsteinsson,…