Mixture of experts – Page 7

July 13, 2025

HiMoE: Heterogeneity-Informed Mixture-of-Experts for Fair Spatial-Temporal Forecastingby Shaohan Yu, Pan Deng, Yu Zhao, Junting Liu,…

July 13, 2025

Condense, Don’t Just Prune: Enhancing Efficiency and Performance in MoE Layer Pruningby Mingyu Cao, Gen…

July 13, 2025

Mixture of Cache-Conditional Experts for Efficient Mobile Device Inferenceby Andrii Skliar, Ties van Rozendaal, Romain…

July 13, 2025

On the effectiveness of discrete representations in sparse mixture of expertsby Giang Do, Kha Pham,…

July 13, 2025

Mixture of Experts in Image Classification: What’s the Sweet Spot?by Mathurin Videau, Alessandro Leite, Marc…

July 13, 2025

LDACP: Long-Delayed Ad Conversions Prediction Model for Bidding Strategyby Peng Cui, Yiming Yang, Fusheng Jin,…

July 13, 2025

Ultra-Sparse Memory Networkby Zihao Huang, Qiyang Min, Hongzhi Huang, Defa Zhu, Yutao Zeng, Ran Guo,…

July 13, 2025

Weakly-Supervised Multimodal Learning on MIMIC-CXRby Andrea Agostini, Daphné Chopard, Yang Meng, Norbert Fortin, Babak Shahbaba,…

July 13, 2025

Sparse Upcycling: Inference Inefficient Finetuningby Sasha Doubov, Nikhil Sardana, Vitaliy ChileyFirst submitted to arxiv on:…

July 13, 2025

Lynx: Enabling Efficient MoE Inference through Dynamic Batch-Aware Expert Selectionby Vima Gupta, Kartik Sinha, Ada…