Summary of Weakly Supervised Deep Hyperspherical Quantization For Image Retrieval, by Jinpeng Wang et al.
Weakly Supervised Deep Hyperspherical Quantization for Image Retrievalby Jinpeng Wang, Bin Chen, Qiang Zhang, Zaiqiao…
Weakly Supervised Deep Hyperspherical Quantization for Image Retrievalby Jinpeng Wang, Bin Chen, Qiang Zhang, Zaiqiao…
Accurate Block Quantization in LLMs with Outliersby Nikita Trukhanov, Ilya SoloveychikFirst submitted to arxiv on:…
QNCD: Quantization Noise Correction for Diffusion Modelsby Huanpeng Chu, Wei Wu, Chengjie Zang, Kun YuanFirst…
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compressionby Junyuan Hong, Jinhao Duan,…
UniCode: Learning a Unified Codebook for Multimodal Large Language Modelsby Sipeng Zheng, Bohan Zhou, Yicheng…
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intactby Ruikang Liu, Haoli Bai,…
Evaluating Quantized Large Language Modelsby Shiyao Li, Xuefei Ning, Luning Wang, Tengxuan Liu, Xiangsheng Shi,…
A Comprehensive Evaluation of Quantization Strategies for Large Language Modelsby Renren Jin, Jiangcun Du, Wuwei…
LLM Inference Unveiled: Survey and Roofline Model Insightsby Zhihang Yuan, Yuzhang Shang, Yang Zhou, Zhen…
Towards Accurate Post-training Quantization for Reparameterized Modelsby Luoming Zhang, Yefei He, Wen Fei, Zhenyu Lou,…