Summary of Bisup: Bidirectional Quantization Error Suppression For Large Language Models, by Minghui Zou et al.
BiSup: Bidirectional Quantization Error Suppression for Large Language Modelsby Minghui Zou, Ronghui Guo, Sai Zhang,…
BiSup: Bidirectional Quantization Error Suppression for Large Language Modelsby Minghui Zou, Ronghui Guo, Sai Zhang,…
OAC: Output-adaptive Calibration for Accurate Post-training Quantizationby Ali Edalati, Alireza Ghaffari, Masoud Asgharian, Lu Hou,…
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Modelsby Wei Huang, Haotong Qin, Yangdong Liu, Yawei…
TerDiT: Ternary Diffusion Models with Transformersby Xudong Lu, Aojun Zhou, Ziyi Lin, Qi Liu, Yuhui…
Mitigating Quantization Errors Due to Activation Spikes in GLU-Based LLMsby Jaewoo Yang, Hayun Kim, Younghoon…
Deep Learning Methods for Adjusting Global MFD Speed Estimations to Local Link Configurationsby Zhixiong Jin,…
A Practice in Enrollment Prediction with Markov Chain Modelsby Yan Zhao, Amy OttesonFirst submitted to…
Rehearsal-free Federated Domain-incremental Learningby Rui Sun, Haoran Duan, Jiahua Dong, Varun Ojha, Tejal Shah, Rajiv…
Efficient Two-Stage Gaussian Process Regression Via Automatic Kernel Search and Subsamplingby Shifan Zhao, Jiaying Lu,…
Challenging Gradient Boosted Decision Trees with Tabular Transformers for Fraud Detection at Booking.comby Sergei Krutikov,…