Summary of Learn to Be Efficient: Build Structured Sparsity in Large Language Models, by Haizhong Zheng et al.
Learn To be Efficient: Build Structured Sparsity in Large Language Modelsby Haizhong Zheng, Xiaoyan Bai,…
Learn To be Efficient: Build Structured Sparsity in Large Language Modelsby Haizhong Zheng, Xiaoyan Bai,…
Anatomizing Deep Learning Inference in Web Browsersby Qipeng Wang, Shiqi Jiang, Zhenpeng Chen, Xu Cao,…
Latent variable model for high-dimensional point process with structured missingnessby Maksim Sinelnikov, Manuel Haussmann, Harri…
RepQuant: Towards Accurate Post-Training Quantization of Large Transformer Models via Scale Reparameterizationby Zhikai Li, Xuewen…
Mitigating Privacy Risk in Membership Inference by Convex-Concave Lossby Zhenlong Liu, Lei Feng, Huiping Zhuang,…
Linearizing Models for Efficient yet Robust Private Inferenceby Sreetama Sarkar, Souvik Kundu, Peter A. BeerelFirst…
Interactive Symbolic Regression through Offline Reinforcement Learning: A Co-Design Frameworkby Yuan Tian, Wenqi Zhou, Michele…
Bellman Conformal Inference: Calibrating Prediction Intervals For Time Seriesby Zitong Yang, Emmanuel Candès, Lihua LeiFirst…
Improved off-policy training of diffusion samplersby Marcin Sendera, Minsu Kim, Sarthak Mittal, Pablo Lemos, Luca…
Hydragen: High-Throughput LLM Inference with Shared Prefixesby Jordan Juravsky, Bradley Brown, Ryan Ehrlich, Daniel Y.…