Summary of Cast: Clustering Self-attention Using Surrogate Tokens For Efficient Transformers, by Adjorn Van Engelenhoven et al.
CAST: Clustering Self-Attention using Surrogate Tokens for Efficient Transformersby Adjorn van Engelenhoven, Nicola Strisciuglio, Estefanía…