Summary of Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior, by Mingyuan Yan et al.
Gaussian Mixture Vector Quantization with Aggregated Categorical Posteriorby Mingyuan Yan, Jiawei Wu, Rushi Shah, Dianbo…
Gaussian Mixture Vector Quantization with Aggregated Categorical Posteriorby Mingyuan Yan, Jiawei Wu, Rushi Shah, Dianbo…
SLiM: One-shot Quantization and Sparsity with Low-rank Approximation for LLM Weight Compressionby Mohammad Mozaffari, Amir…
FlatQuant: Flatness Matters for LLM Quantizationby Yuxuan Sun, Ruikang Liu, Haoli Bai, Han Bao, Kang…
QEFT: Quantization for Efficient Fine-Tuning of LLMsby Changhun Lee, Jun-gyu Jin, Younghyun Cho, Eunhyeok ParkFirst…
DeltaDQ: Ultra-High Delta Compression for Fine-Tuned LLMs via Group-wise Dropout and Separate Quantizationby Yanfeng Jiang,…
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generationby Jiatao Gu, Yuyang Wang, Yizhe Zhang, Qihang…
Scalable Representation Learning for Multimodal Tabular Transactionsby Natraj Raman, Sumitra Ganesh, Manuela VelosoFirst submitted to…
CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compressionby…
Scaling Laws for Mixed quantization in Large Language Modelsby Zeyu Cao, Cheng Zhang, Pedro Gimenes,…
Covering Numbers for Deep ReLU Networks with Applications to Function Approximation and Nonparametric Regressionby Weigutian…