Summary of Evaluating the Generalization Ability Of Quantized Llms: Benchmark, Analysis, and Toolbox, by Yijun Liu et al.
Evaluating the Generalization Ability of Quantized LLMs: Benchmark, Analysis, and Toolboxby Yijun Liu, Yuan Meng,…
Evaluating the Generalization Ability of Quantized LLMs: Benchmark, Analysis, and Toolboxby Yijun Liu, Yuan Meng,…
Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Modelsby Dongwon Jo, Taesu Kim, Yulhwa…
Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantizationby Seungwoo Son, Wonpyo…
ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinkingby Wenshuo Li, Xinghao Chen, Han Shu,…
QTIP: Quantization with Trellises and Incoherence Processingby Albert Tseng, Qingyao Sun, David Hou, Christopher De…
Promoting Data and Model Privacy in Federated Learning through Quantized LoRAby JianHao Zhu, Changze Lv,…
Memory Faults in Activation-sparse Quantized Deep Neural Networks: Analysis and Mitigation using Sharpness-aware Trainingby Akul…
MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Casesby Rithesh Murthy, Liangwei Yang, Juntao Tan,…
Precipitation Nowcasting Using Physics Informed Discriminator Generative Modelsby Junzhe Yin, Cristian Meo, Ankush Roy, Zeineh…
QQQ: Quality Quattuor-Bit Quantization for Large Language Modelsby Ying Zhang, Peng Zhang, Mincong Huang, Jingyang…