Summary of Conv-basis: a New Paradigm For Efficient Attention Inference and Gradient Computation in Transformers, by Yingyu Liang et al.
Conv-Basis: A New Paradigm for Efficient Attention Inference and Gradient Computation in Transformersby Yingyu Liang,…