Summary of Leanquant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid, by Tianyi Zhang et al.
LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Gridby Tianyi Zhang, Anshumali ShrivastavaFirst…