Summary of Adazeta: Adaptive Zeroth-order Tensor-train Adaption For Memory-efficient Large Language Models Fine-tuning, by Yifan Yang et al.
AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuningby Yifan Yang, Kai Zhen,…