Summary of Croissantllm: a Truly Bilingual French-english Language Model, by Manuel Faysse et al.
CroissantLLM: A Truly Bilingual French-English Language Modelby Manuel Faysse, Patrick Fernandes, Nuno M. Guerreiro, António…
CroissantLLM: A Truly Bilingual French-English Language Modelby Manuel Faysse, Patrick Fernandes, Nuno M. Guerreiro, António…
ReAGent: A Model-agnostic Feature Attribution Method for Generative Language Modelsby Zhixue Zhao, Boxuan ShanFirst submitted…
EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Modelsby Xuchen Pan, Yanxi…
Comparing Template-based and Template-free Language Model Probingby Sagi Shaier, Kevin Bennett, Lawrence E Hunter, Katharina…
Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language Modelsby Lai Wei, Zhiquan Tan, Chenghai…
ReacLLaMA: Merging chemical and textual information in chemical reactivity AI modelsby Aline Hartgers, Ramil Nugmanov,…
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensembleby Shun Zhang, Zhenfang Chen,…
Vocabulary-Defined Semantics: Latent Space Clustering for Improving In-Context Learningby Jian Gu, Aldeida Aleti, Chunyang Chen,…
Routers in Vision Mixture of Experts: An Empirical Studyby Tianlin Liu, Mathieu Blondel, Carlos Riquelme,…
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertaintyby Yuhui Li, Fangyun Wei, Chao Zhang, Hongyang ZhangFirst…