Summary of Residual Vector Quantization For Kv Cache Compression in Large Language Model, by Ankur Kumar
Residual vector quantization for KV cache compression in large language modelby Ankur KumarFirst submitted to…
Residual vector quantization for KV cache compression in large language modelby Ankur KumarFirst submitted to…
Solving Continual Offline RL through Selective Weights Activation on Aligned Spacesby Jifeng Hu, Sili Huang,…
Estimating Individual Dose-Response Curves under Unobserved Confounders from Observational Databy Shutong Chen, Yang LiFirst submitted…
Offline reinforcement learning for job-shop scheduling problemsby Imanol Echeverria, Maialen Murua, Roberto SantanaFirst submitted to…
Traffic Matrix Estimation based on Denoising Diffusion Probabilistic Modelby Xinyu Yuan, Yan Qiao, Pei Zhao,…
A Two-Stage Learning-to-Defer Approach for Multi-Task Learningby Yannis Montreuil, Shu Heng Yeo, Axel Carlier, Lai…
S-CFE: Simple Counterfactual Explanationsby Shpresim Sadiku, Moritz Wagner, Sai Ganesh Nagarajan, Sebastian PokuttaFirst submitted to…
Object-Centric Temporal Consistency via Conditional Autoregressive Inductive Biasesby Cristian Meo, Akihiro Nakano, Mircea Lică, Aniket…
DeepVigor+: Scalable and Accurate Semi-Analytical Fault Resilience Analysis for Deep Neural Networkby Mohammad Hasan Ahmadilivani,…
Optimal Query Allocation in Extractive QA with LLMs: A Learning-to-Defer Framework with Theoretical Guaranteesby Yannis…