Summary of Lillama: Large Language Models Compression Via Low-rank Feature Distillation, by Yaya Sy and Christophe Cerisara and Irina Illina
Lillama: Large Language Models Compression via Low-Rank Feature Distillationby Yaya Sy, Christophe Cerisara, Irina IllinaFirst…