Summary of Investigating Neuron Ablation in Attention Heads: the Case For Peak Activation Centering, by Nicholas Pochinkov et al.
Investigating Neuron Ablation in Attention Heads: The Case for Peak Activation Centeringby Nicholas Pochinkov, Ben…
Investigating Neuron Ablation in Attention Heads: The Case for Peak Activation Centeringby Nicholas Pochinkov, Ben…
The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order Informationby Diyuan Wu, Ionut-Vlad…
Modularity in Transformers: Investigating Neuron Separability & Specializationby Nicholas Pochinkov, Thomas Jones, Mohammed Rashidur RahmanFirst…
An Empirical Study of Scaling Laws for Transferby Matthew BarnettFirst submitted to arxiv on: 30…
Error-controlled non-additive interaction discovery in machine learning modelsby Winston Chen, Yifan Jiang, William Stafford Noble,…
HLogformer: A Hierarchical Transformer for Representing Log Databy Zhichao Hou, Mina Ghashami, Mikhail Kuznetsov, MohamadAli…
On-device AI: Quantization-aware Training of Transformers in Time-Seriesby Tianheng Ling, Gregor SchieleFirst submitted to arxiv…
Large-Scale Multi-omic Biosequence Transformers for Modeling Peptide-Nucleotide Interactionsby Sully F. Chen, Robert J. Steele, Beakal…
TCNFormer: Temporal Convolutional Network Former for Short-Term Wind Speed Forecastingby Abid Hasan Zim, Aquib Iqbal,…
Automatic Differential Diagnosis using Transformer-Based Multi-Label Sequence Classificationby Abu Adnan Sadi, Mohammad Ashrafuzzaman Khan, Lubaba…