Summary of Energy-based Diffusion Language Models For Text Generation, by Minkai Xu et al.
Energy-Based Diffusion Language Models for Text Generationby Minkai Xu, Tomas Geffner, Karsten Kreis, Weili Nie,…
Energy-Based Diffusion Language Models for Text Generationby Minkai Xu, Tomas Geffner, Karsten Kreis, Weili Nie,…
Peptide-GPT: Generative Design of Peptides using Generative Pre-trained Transformers and Bio-informatic Supervisionby Aayush Shah, Chakradhar…
TesseraQ: Ultra Low-Bit LLM Post-Training Quantization with Block Reconstructionby Yuhang Li, Priyadarshini PandaFirst submitted to…
FedBaF: Federated Learning Aggregation Biased by a Foundation Modelby Jong-Ik Park, Srinivasa Pranav, José M.…
Methods of improving LLM training stabilityby Oleg Rybakov, Mike Chrzanowski, Peter Dykas, Jinze Xue, Ben…
Compute-Constrained Data Selectionby Junjie Oscar Yin, Alexander M. RushFirst submitted to arxiv on: 21 Oct…
A Realistic Threat Model for Large Language Model Jailbreaksby Valentyn Boreiko, Alexander Panfilov, Vaclav Voracek,…
Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuningby Arijit DasFirst submitted to arxiv…
CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Expertsby Zhenpeng Su, Xing…
QuAILoRA: Quantization-Aware Initialization for LoRAby Neal Lawton, Aishwarya Padmakumar, Judith Gaspers, Jack FitzGerald, Anoop Kumar,…