Summary of Regmix: Data Mixture As Regression For Language Model Pre-training, by Qian Liu et al.
RegMix: Data Mixture as Regression for Language Model Pre-trainingby Qian Liu, Xiaosen Zheng, Niklas Muennighoff,…
RegMix: Data Mixture as Regression for Language Model Pre-trainingby Qian Liu, Xiaosen Zheng, Niklas Muennighoff,…
MoE-CT: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgettingby Tianhao…
Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Drivingby Ran Tian,…
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agentsby Shihan Deng, Weikai Xu, Hongda Sun, Wei…
Into the Unknown: Generating Geospatial Descriptions for New Environmentsby Tzuf Paz-Argaman, John Palowitch, Sayali Kulkarni,…
Molecular Facts: Desiderata for Decontextualization in LLM Fact Verificationby Anisha Gunjal, Greg DurrettFirst submitted to…
Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Utilizationby Miyoung Ko, Sue…
AutoPureData: Automated Filtering of Undesirable Web Data to Update LLM Knowledgeby Praneeth VadlapatiFirst submitted to…
IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Languageby Lucky Susanto,…
MammothModa: Multi-Modal Large Language Modelby Qi She, Junwen Pan, Xin Wan, Rui Zhang, Dawei Lu,…