Summary of Mamba-ptq: Outlier Channels in Recurrent Large Language Models, by Alessandro Pierro et al.
Mamba-PTQ: Outlier Channels in Recurrent Large Language Modelsby Alessandro Pierro, Steven AbreuFirst submitted to arxiv…
Mamba-PTQ: Outlier Channels in Recurrent Large Language Modelsby Alessandro Pierro, Steven AbreuFirst submitted to arxiv…
Team up GBDTs and DNNs: Advancing Efficient and Effective Tabular Prediction with Tree-hybrid MLPsby Jiahuan…
Diversifying the Expert Knowledge for Task-Agnostic Pruning in Sparse Mixture-of-Expertsby Zeliang Zhang, Xiaodong Liu, Hao…
Weight Block Sparsity: Training, Compilation, and AI Engine Acceleratorsby Paolo D'Alberto, Taehee Jeong, Akshai Jain,…
Automatic Pruning of Fine-tuning Datasets for Transformer-based Language Modelsby Mohammadreza Tayaranian, Seyyed Hasan Mozafari, Brett…
Characterizing Prompt Compression Methods for Long Context Inferenceby Siddharth Jha, Lutfi Eren Erdogan, Sehoon Kim,…
Explaining Graph Neural Networks for Node Similarity on Graphsby Daniel Daza, Cuong Xuan Chu, Trung-Kien…
Graph Anomaly Detection with Noisy Labels by Reinforcement Learningby Zhu Wang, Shuang Zhou, Junnan Dong,…
Pruning One More Token is Enough: Leveraging Latency-Workload Non-Linearities for Vision Transformers on the Edgeby…
DMTG: One-Shot Differentiable Multi-Task Groupingby Yuan Gao, Shuguo Jiang, Moran Li, Jin-Gang Yu, Gui-Song XiaFirst…