Summary of Mira: a Method Of Federated Multi-task Learning For Large Language Models, by Ahmed Elbakary et al.
MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Modelsby Ahmed Elbakary, Chaouki Ben…
MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Modelsby Ahmed Elbakary, Chaouki Ben…
Collaboratively adding new knowledge to an LLMby Rhui Dih Lee, Laura WynterFirst submitted to arxiv…
Implicit Regularization of Sharpness-Aware Minimization for Scale-Invariant Problemsby Bingcong Li, Liang Zhang, Niao HeFirst submitted…
QuAILoRA: Quantization-Aware Initialization for LoRAby Neal Lawton, Aishwarya Padmakumar, Judith Gaspers, Jack FitzGerald, Anoop Kumar,…
FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Modelby ZiDong Wang, Zeyu Lu, Di…
MoR: Mixture of Ranks for Low-Rank Adaptation Tuningby Chuanyu Tang, Yilong Chen, Zhenyu Zhang, Junyuan…
LoRA Soups: Merging LoRAs for Practical Skill Composition Tasksby Akshara Prabhakar, Yuanzhi Li, Karthik Narasimhan,…
In-context KV-Cache Eviction for LLMs via Attention-Gateby Zihao Zeng, Bokai Lin, Tianqi Hou, Hao Zhang,…
LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Modelsby Hossein Abdi, Mingfei Sun, Andi…
AT-MoE: Adaptive Task-planning Mixture of Experts via LoRA Approachby Xurui Li, Juanjuan YaoFirst submitted to…