Summary of Ve: Modeling Multivariate Time Series Correlation with Variate Embedding, by Shangjiong Wang et al.
VE: Modeling Multivariate Time Series Correlation with Variate Embeddingby Shangjiong Wang, Zhihong Man, Zhenwei Cao,…
VE: Modeling Multivariate Time Series Correlation with Variate Embeddingby Shangjiong Wang, Zhihong Man, Zhenwei Cao,…
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruningby Jaeseong Lee, seung-won hwang, Aurick Qiao, Daniel F…
Alt-MoE:A Scalable Framework for Bidirectional Multimodal Alignment and Efficient Knowledge Integrationby Hongyang Lei, Xiaolong Cheng,…
Interpretable mixture of experts for time series prediction under recurrent and non-recurrent conditionsby Zemian Ke,…
OLMoE: Open Mixture-of-Experts Language Modelsby Niklas Muennighoff, Luca Soldaini, Dirk Groeneveld, Kyle Lo, Jacob Morrison,…
Beyond Parameter Count: Implicit Bias in Soft Mixture of Expertsby Youngseog Chung, Dhruv Malik, Jeff…
Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Expertsby Lean Wang, Huazuo Gao, Chenggang Zhao, Xu Sun, Damai…
Advancing Enterprise Spatio-Temporal Forecasting Applications: Data Mining Meets Instruction Tuning of Language Models For Multi-modal…
The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies,…
SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Mergingby Mohammadreza Pourreza,…