Summary of Scaling Up Masked Diffusion Models on Text, by Shen Nie et al.
Scaling up Masked Diffusion Models on Textby Shen Nie, Fengqi Zhu, Chao Du, Tianyu Pang,…
Scaling up Masked Diffusion Models on Textby Shen Nie, Fengqi Zhu, Chao Du, Tianyu Pang,…
Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Databy Anup Shirgaonkar,…
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planningby Jiacheng Ye, Jiahui Gao, Shansan Gong,…
H2OVL-Mississippi Vision Language Models Technical Reportby Shaikat Galib, Shanshan Wang, Guanshuo Xu, Pascal Pfeiffer, Ryan…
Enhancing Text Generation in Joint NLG/NLU Learning Through Curriculum Learning, Semi-Supervised Training, and Advanced Optimization…
TextLap: Customizing Language Models for Text-to-Layout Planningby Jian Chen, Ruiyi Zhang, Yufan Zhou, Jennifer Healey,…
TapWeight: Reweighting Pretraining Objectives for Task-Adaptive Pretrainingby Ruiyi Zhang, Sai Ashish Somayajula, Pengtao XieFirst submitted…
MTL-LoRA: Low-Rank Adaptation for Multi-Task Learningby Yaming Yang, Dilxat Muhtar, Yelong Shen, Yuefeng Zhan, Jianfeng…
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptationby Fabian Paischer, Lukas Hauzenberger,…
Selective Aggregation for Low-Rank Adaptation in Federated Learningby Pengxin Guo, Shuang Zeng, Yanran Wang, Huijie…