Summary of Beexformer: a Fast Inferencing Transformer Architecture Via Binarization with Multiple Early Exits, by Wazib Ansar et al.
BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exitsby Wazib Ansar, Saptarsi…