Summary of Logah: Predicting 774-million-parameter Transformers Using Graph Hypernetworks with 1/100 Parameters, by Xinyu Zhou et al.
LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parametersby Xinyu Zhou, Boris Knyazev, Alexia…