Summary of Adaptive Stream Processing on Edge Devices Through Active Inference, by Boris Sedlak et al.
Adaptive Stream Processing on Edge Devices through Active Inferenceby Boris Sedlak, Victor Casamayor Pujol, Andrea…
Adaptive Stream Processing on Edge Devices through Active Inferenceby Boris Sedlak, Victor Casamayor Pujol, Andrea…
IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioningby Soeun Lee, Si-Woo Kim, Taewhan…
Recent advances in interpretable machine learning using structure-based protein representationsby Luiz Felipe Vecchietti, Minji Lee,…
Conjugate Bayesian Two-step Change Point Detection for Hawkes Processby Zeyue Zhang, Xiaoling Lu, Feng ZhouFirst…
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inferenceby Qining Zhang, Lei…
Discovering the Gems in Early Layers: Accelerating Long-Context LLMs with 1000x Input Token Reductionby Zhenmei…
Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesisby Chirag Vashist, Shichong Peng, Ke…
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Modelsby Gongfan Fang, Hongxu Yin, Saurav Muralidharan, Greg…
Mnemosyne: Parallelization Strategies for Efficiently Serving Multi-Million Context Length LLM Inference Requests Without Approximationsby Amey…
INT-FlashAttention: Enabling Flash Attention for INT8 Quantizationby Shimao Chen, Zirui Liu, Zhiying Wu, Ce Zheng,…