Summary of Lqer: Low-rank Quantization Error Reconstruction For Llms, by Cheng Zhang et al.
LQER: Low-Rank Quantization Error Reconstruction for LLMsby Cheng Zhang, Jianyi Cheng, George A. Constantinides, Yiren…
LQER: Low-Rank Quantization Error Reconstruction for LLMsby Cheng Zhang, Jianyi Cheng, George A. Constantinides, Yiren…
Locally-Adaptive Quantization for Streaming Vector Searchby Cecilia Aguerrebere, Mark Hildebrand, Ishwar Singh Bhati, Theodore Willke,…
Improved Quantization Strategies for Managing Heavy-tailed Gradients in Distributed Learningby Guangfeng Yan, Tan Li, Yuanzhang…
Large Language Models for Time Series: A Surveyby Xiyuan Zhang, Ranak Roy Chowdhury, Rajesh K.…
SignSGD with Federated Defense: Harnessing Adversarial Attacks through Gradient Sign Decodingby Chanho Park, Namyoon LeeFirst…
Truncated Non-Uniform Quantization for Distributed SGDby Guangfeng Yan, Tan Li, Yuanzhang Xiao, Congduan Li, Linqi…
HW-SW Optimization of DNNs for Privacy-preserving People Counting on Low-resolution Infrared Arraysby Matteo Risso, Chen…
FedShift: Robust Federated Learning Aggregation Scheme in Resource Constrained Environment via Weight Shiftingby Jungwon Seo,…
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantizationby Coleman Hooper, Sehoon…
Trainable Fixed-Point Quantization for Deep Learning Acceleration on FPGAsby Dingyi Dai, Yichi Zhang, Jiahao Zhang,…