Summary of Toward Generalizable Learning Of All (linear) First-order Methods Via Memory Augmented Transformers, by Sanchayan Dutta (uc Davis) et al.
Toward generalizable learning of all (linear) first-order methods via memory augmented Transformersby Sanchayan Dutta, Suvrit…