Summary of Crossquant: a Post-training Quantization Method with Smaller Quantization Kernel For Precise Large Language Model Compression, by Wenyuan Liu et al.
CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compressionby…