Summary of Library Learning Doesn’t: the Curious Case Of the Single-use “library”, by Ian Berlot-attwell et al.
Library Learning Doesn’t: The Curious Case of the Single-Use “Library”by Ian Berlot-Attwell, Frank Rudzicz, Xujie…
Library Learning Doesn’t: The Curious Case of the Single-Use “Library”by Ian Berlot-Attwell, Frank Rudzicz, Xujie…
Notes on the Mathematical Structure of GPT LLM Architecturesby Spencer Becker-KahnFirst submitted to arxiv on:…
Inference time LLM alignment in single and multidomain preference spectrumby Sadat Shahriar, Zheng Qi, Nikolaos…
Research on Key Technologies for Cross-Cloud Federated Training of Large Language Modelsby Haowei Yang, Mingxiu…
Dynamic Vocabulary Pruning in Early-Exit LLMsby Jort Vincenti, Karim Abdel Sadek, Joan Velja, Matteo Nulli,…
Unbounded: A Generative Infinite Game of Character Life Simulationby Jialu Li, Yuanzhen Li, Neal Wadhwa,…
Ferret-UI 2: Mastering Universal User Interface Understanding Across Platformsby Zhangheng Li, Keen You, Haotian Zhang,…
BATON: Enhancing Batch-wise Inference Efficiency for Large Language Models via Dynamic Re-batchingby Peizhuang Cong, Qizhi…
A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMsby Ankit…
POD-Attention: Unlocking Full Prefill-Decode Overlap for Faster LLM Inferenceby Aditya K Kamath, Ramya Prabhu, Jayashree…