Summary of Kraken: Inherently Parallel Transformers For Efficient Multi-device Inference, by Rohan Baskar Prabhakar et al.
Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inferenceby Rohan Baskar Prabhakar, Hengrui Zhang, David WentzlaffFirst…