Summary of Rethinking Optimization and Architecture For Tiny Language Models, by Yehui Tang et al.
Rethinking Optimization and Architecture for Tiny Language Modelsby Yehui Tang, Fangcheng Liu, Yunsheng Ni, Yuchuan…
Rethinking Optimization and Architecture for Tiny Language Modelsby Yehui Tang, Fangcheng Liu, Yunsheng Ni, Yuchuan…
CroissantLLM: A Truly Bilingual French-English Language Modelby Manuel Faysse, Patrick Fernandes, Nuno M. Guerreiro, António…
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalitiesby Yiyuan Zhang, Xiaohan Ding, Kaixiong…
How Can Large Language Models Understand Spatial-Temporal Data?by Lei Liu, Shuo Yu, Runze Wang, Zhenxun…
RoBERTurk: Adjusting RoBERTa for Turkishby Nuri TasFirst submitted to arxiv on: 7 Jan 2024CategoriesMain: Computation…