Summary of Addressing Representation Collapse in Vector Quantized Models with One Linear Layer, by Yongxin Zhu et al.
Addressing Representation Collapse in Vector Quantized Models with One Linear Layerby Yongxin Zhu, Bocheng Li,…
Addressing Representation Collapse in Vector Quantized Models with One Linear Layerby Yongxin Zhu, Bocheng Li,…
Conformalized High-Density Quantile Regression via Dynamic Prototypes-based Probability Density Estimationby Batuhan Cengiz, Halil Faruk Karagoz,…
Abstracted Shapes as Tokens – A Generalizable and Interpretable Model for Time-series Classificationby Yunshi Wen,…
Accelerated AI Inference via Dynamic Execution Methodsby Haim Barad, Jascha Achterberg, Tien Pei Chou, Jean…
GWQ: Gradient-Aware Weight Quantization for Large Language Modelsby Yihua Shao, Siyu Liang, Zijian Ling, Minxi…
ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNsby Yuchen Yang, Shubham Ugare,…
BitStack: Any-Size Compression of Large Language Models in Variable Memory Environmentsby Xinghao Wang, Pengyu Wang,…
Breaking Determinism: Fuzzy Modeling of Sequential Recommendation Using Discrete State Space Diffusion Modelby Wenjia Xie,…
Data Generation for Hardware-Friendly Post-Training Quantizationby Lior Dikstein, Ariel Lapid, Arnon Netzer, Hai Victor HabiFirst…
The Impact of Inference Acceleration on Bias of LLMsby Elisabeth Kirsten, Ivan Habernal, Vedant Nanda,…