Summary of Eagle: Speculative Sampling Requires Rethinking Feature Uncertainty, by Yuhui Li et al.
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertaintyby Yuhui Li, Fangyun Wei, Chao Zhang, Hongyang ZhangFirst…
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertaintyby Yuhui Li, Fangyun Wei, Chao Zhang, Hongyang ZhangFirst…
Adaptive Point Transformerby Alessandro Baiocchi, Indro Spinelli, Alessandro Nicolosi, Simone ScardapaneFirst submitted to arxiv on:…
A structured regression approach for evaluating model performance across intersectional subgroupsby Christine Herlihy, Kimberly Truong,…
Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Modelsby Erik Arakelyan, Zhaoqi Liu,…
Bayesian Optimization through Gaussian Cox Process Models for Spatio-temporal Databy Yongsheng Mei, Mahdi Imani, Tian…
MoE-Infinity: Efficient MoE Inference on Personal Machines with Sparsity-Aware Expert Cacheby Leyang Xue, Yao Fu,…
ServerlessLLM: Low-Latency Serverless Inference for Large Language Modelsby Yao Fu, Leyang Xue, Yeqi Huang, Andrei-Octavian…
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalitiesby Yiyuan Zhang, Xiaohan Ding, Kaixiong…
CompactifAI: Extreme Compression of Large Language Models using Quantum-Inspired Tensor Networksby Andrei Tomut, Saeed S.…
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Designby Haojun Xia, Zhen Zheng, Xiaoxia…