Summary of Modegpt: Modular Decomposition For Large Language Model Compression, by Chi-heng Lin et al.
MoDeGPT: Modular Decomposition for Large Language Model Compressionby Chi-Heng Lin, Shangqian Gao, James Seale Smith,…
MoDeGPT: Modular Decomposition for Large Language Model Compressionby Chi-Heng Lin, Shangqian Gao, James Seale Smith,…
A Mean Field Ansatz for Zero-Shot Weight Transferby Xingyuan Chen, Wenwei Kuang, Lei Deng, Wei…
JPEG-LM: LLMs as Image Generators with Canonical Codec Representationsby Xiaochuang Han, Marjan Ghazvininejad, Pang Wei…
Benchmarking the Capabilities of Large Language Models in Transportation System Engineering: Accuracy, Consistency, and Reasoning…
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentby Karel D'Oosterlinck, Winnie Xu, Chris…
Eigen Attention: Attention in Low-Rank Space for KV Cache Compressionby Utkarsh Saxena, Gobinda Saha, Sakshi…
BA-LoRA: Bias-Alleviating Low-Rank Adaptation to Mitigate Catastrophic Inheritance in Large Language Modelsby Yupeng Chang, Yi…
Efficacy of Large Language Models in Systematic Reviewsby Aaditya Shah, Shridhar Mehendale, Siddha KanthiFirst submitted…
Risks, Causes, and Mitigations of Widespread Deployments of Large Language Models (LLMs): A Surveyby Md…
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clustersby Vasudev Shyam, Jonathan Pilault, Emily…