Summary of Hydra: Sequentially-dependent Draft Heads For Medusa Decoding, by Zachary Ankner et al.
Hydra: Sequentially-Dependent Draft Heads for Medusa Decodingby Zachary Ankner, Rishab Parthasarathy, Aniruddha Nrusimha, Christopher Rinard,…
Hydra: Sequentially-Dependent Draft Heads for Medusa Decodingby Zachary Ankner, Rishab Parthasarathy, Aniruddha Nrusimha, Christopher Rinard,…
Pard: Permutation-Invariant Autoregressive Diffusion for Graph Generationby Lingxiao Zhao, Xueying Ding, Leman AkogluFirst submitted to…
MobilityGPT: Enhanced Human Mobility Modeling with a GPT modelby Ammar Haydari, Dongjie Chen, Zhengfeng Lai,…
Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learningby Abdelhakim Benechehab, Albert…
Arithmetic in Transformers Explainedby Philip Quirke, Clement Neo, Fazl BarezFirst submitted to arxiv on: 4…
AutoTimes: Autoregressive Time Series Forecasters via Large Language Modelsby Yong Liu, Guo Qin, Xiangdong Huang,…
Break the Sequential Dependency of LLM Inference Using Lookahead Decodingby Yichao Fu, Peter Bailis, Ion…
Neural Language of Thought Modelsby Yi-Fu Wu, Minseung Lee, Sungjin AhnFirst submitted to arxiv on:…
Causal Coordinated Concurrent Reinforcement Learningby Tim Tse, Isaac Chan, Zhitang ChenFirst submitted to arxiv on:…
Arrows of Time for Large Language Modelsby Vassilis Papadopoulos, Jérémie Wenger, Clément HonglerFirst submitted to…