Summary of Scaling Laws For Multilingual Language Models, by Yifei He et al.
Scaling Laws for Multilingual Language Modelsby Yifei He, Alon Benhaim, Barun Patra, Praneetha Vaddamanu, Sanchit…
Scaling Laws for Multilingual Language Modelsby Yifei He, Alon Benhaim, Barun Patra, Praneetha Vaddamanu, Sanchit…
Adaptive Data Optimization: Dynamic Sample Selection with Scaling Lawsby Yiding Jiang, Allan Zhou, Zhili Feng,…
TSDS: Data Selection for Task-Specific Model Finetuningby Zifan Liu, Amin Karbasi, Theodoros RekatsinasFirst submitted to…
Fine-tuning can Help Detect Pretraining Data from Large Language Modelsby Hengxiang Zhang, Songxin Zhang, Bingyi…
Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learningby Etai Littwin, Vimal Thilak, Anand…
GraphCLIP: Enhancing Transferability in Graph Foundation Models for Text-Attributed Graphsby Yun Zhu, Haizhou Shi, Xiaotang…
GIFT-Eval: A Benchmark For General Time Series Forecasting Model Evaluationby Taha Aksu, Gerald Woo, Juncheng…
LoLCATs: On Low-Rank Linearizing of Large Language Modelsby Michael Zhang, Simran Arora, Rahul Chalamala, Alan…
TapWeight: Reweighting Pretraining Objectives for Task-Adaptive Pretrainingby Ruiyi Zhang, Sai Ashish Somayajula, Pengtao XieFirst submitted…
Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defenseby Rui Min, Zeyu Qin, Nevin…