Large language model – Page 74

July 13, 2025

Library Learning Doesn’t: The Curious Case of the Single-Use “Library”by Ian Berlot-Attwell, Frank Rudzicz, Xujie…

July 13, 2025

Notes on the Mathematical Structure of GPT LLM Architecturesby Spencer Becker-KahnFirst submitted to arxiv on:…

July 13, 2025

Inference time LLM alignment in single and multidomain preference spectrumby Sadat Shahriar, Zheng Qi, Nikolaos…

July 13, 2025

Research on Key Technologies for Cross-Cloud Federated Training of Large Language Modelsby Haowei Yang, Mingxiu…

July 13, 2025

Dynamic Vocabulary Pruning in Early-Exit LLMsby Jort Vincenti, Karim Abdel Sadek, Joan Velja, Matteo Nulli,…

July 13, 2025

Unbounded: A Generative Infinite Game of Character Life Simulationby Jialu Li, Yuanzhen Li, Neal Wadhwa,…

July 13, 2025

Ferret-UI 2: Mastering Universal User Interface Understanding Across Platformsby Zhangheng Li, Keen You, Haotian Zhang,…

July 13, 2025

BATON: Enhancing Batch-wise Inference Efficiency for Large Language Models via Dynamic Re-batchingby Peizhuang Cong, Qizhi…

July 13, 2025

A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMsby Ankit…

July 13, 2025

POD-Attention: Unlocking Full Prefill-Decode Overlap for Faster LLM Inferenceby Aditya K Kamath, Ramya Prabhu, Jayashree…