Summary of Frugal: Memory-efficient Optimization by Reducing State Overhead For Scalable Training, By Philip Zmushko et al.
FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Trainingby Philip Zmushko, Aleksandr Beznosikov, Martin…