Summary of The Solution For the Aigc Inference Performance Optimization Competition, by Sishun Pan et al.
The Solution for the AIGC Inference Performance Optimization Competitionby Sishun Pan, Haonan Xu, Zhonghua Wan,…
The Solution for the AIGC Inference Performance Optimization Competitionby Sishun Pan, Haonan Xu, Zhonghua Wan,…
The Impact of Quantization and Pruning on Deep Reinforcement Learning Modelsby Heng Lu, Mehdi Alemi,…
Isomorphic Pruning for Vision Modelsby Gongfan Fang, Xinyin Ma, Michael Bi Mi, Xinchao WangFirst submitted…
LayerShuffle: Enhancing Robustness in Vision Transformers by Randomizing Layer Execution Orderby Matthias Freiberger, Peter Kun,…
Functional Faithfulness in the Wild: Circuit Discovery with Differentiable Computation Graph Pruningby Lei Yu, Jingcheng…
An Efficient Framework for Crediting Data Contributors of Diffusion Modelsby Chris Lin, Mingyu Lu, Chanwoo…
Joint Pruning and Channel-wise Mixed-Precision Quantization for Efficient Deep Neural Networksby Beatrice Alessandra Motetti, Matteo…
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costsby Enshu…
Efficient Automated Circuit Discovery in Transformers using Contextual Decompositionby Aliyah R. Hsu, Georgia Zhou, Yeshwanth…
Reliable edge machine learning hardware for scientific applicationsby Tommaso Baldi, Javier Campos, Ben Hawks, Jennifer…