Summary of Sambalingo: Teaching Large Language Models New Languages, by Zoltan Csaki et al.
SambaLingo: Teaching Large Language Models New Languagesby Zoltan Csaki, Bo Li, Jonathan Li, Qiantong Xu,…
SambaLingo: Teaching Large Language Models New Languagesby Zoltan Csaki, Bo Li, Jonathan Li, Qiantong Xu,…
Enhancing Inference Efficiency of Large Language Models: Investigating Optimization Strategies and Architectural Innovationsby Georgy TyukinFirst…
Does Biomedical Training Lead to Better Medical Performance?by Amin Dada, Marie Bauer, Amanda Butler Contreras,…
ROPO: Robust Preference Optimization for Large Language Modelsby Xize Liang, Chao Chen, Shuang Qiu, Jie…
FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skippingby Ajay Jaiswal, Bodun…
Linguistic Calibration of Long-Form Generationsby Neil Band, Xuechen Li, Tengyu Ma, Tatsunori HashimotoFirst submitted to…
Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Scienceby Yazheng…
Latxa: An Open Language Model and Evaluation Suite for Basqueby Julen Etxaniz, Oscar Sainz, Naiara…
Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Modelsby Ang Lv, Yuhan Chen, Kaiyi…
ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Cachingby Youpeng Zhao, Di Wu, Jun…