Summary of Drive: Dual Gradient-based Rapid Iterative Pruning, by Dhananjay Saikumar et al.
DRIVE: Dual Gradient-Based Rapid Iterative Pruningby Dhananjay Saikumar, Blesson VargheseFirst submitted to arxiv on: 1…
DRIVE: Dual Gradient-Based Rapid Iterative Pruningby Dhananjay Saikumar, Blesson VargheseFirst submitted to arxiv on: 1…
A Layer Selection Approach to Test Time Adaptationby Sabyasachi Sahoo, Mostafa ElAraby, Jonas Ngnawe, Yann…
Training LLMs over Neurally Compressed Textby Brian Lester, Jaehoon Lee, Alex Alemi, Jeffrey Pennington, Adam…
Federated Unlearning for Human Activity Recognitionby Kongyang Chen, Dongping zhang, Yaping Chai, Weibin Zhang, Shaowei…
RL for Consistency Models: Faster Reward Guided Text-to-Image Generationby Owen Oertell, Jonathan D. Chang, Yiyi…
Optimizing the Deployment of Tiny Transformers on Low-Power MCUsby Victor J.B. Jung, Alessio Burrello, Moritz…
Automatic Prompt Selection for Large Language Modelsby Viet-Tung Do, Van-Khanh Hoang, Duy-Hung Nguyen, Shahab Sabahi,…
On-line conformalized neural networks ensembles for probabilistic forecasting of day-ahead electricity pricesby Alessandro Brusaferri, Andrea…
Toward Inference-optimal Mixture-of-Expert Large Language Modelsby Longfei Yun, Yonghao Zhuang, Yao Fu, Eric P Xing,…
Guarantees of confidentiality via Hammersley-Chapman-Robbins boundsby Kamalika Chaudhuri, Chuan Guo, Laurens van der Maaten, Saeed…