Summary of Mixture Of a Million Experts, by Xu Owen He
Mixture of A Million Expertsby Xu Owen HeFirst submitted to arxiv on: 4 Jul 2024CategoriesMain:…
Mixture of A Million Expertsby Xu Owen HeFirst submitted to arxiv on: 4 Jul 2024CategoriesMain:…
Terminating Differentiable Tree Expertsby Jonathan Thomm, Michael Hersche, Giacomo Camposampiero, Aleksandar Terzić, Bernhard Schölkopf, Abbas…
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Modelsby…
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costsby Enshu…
A Teacher Is Worth A Million Instructionsby Nikhil Kothari, Ravindra Nayak, Shreyas Shetty, Amey Patil,…
A Closer Look into Mixture-of-Experts in Large Language Modelsby Ka Man Lo, Zeyu Huang, Zihan…
Peirce in the Machine: How Mixture of Experts Models Perform Hypothesis Constructionby Bruce RushingFirst submitted…
Theory on Mixture-of-Experts in Continual Learningby Hongbo Li, Sen Lin, Lingjie Duan, Yingbin Liang, Ness…
Interpretable Cascading Mixture-of-Experts for Urban Traffic Congestion Predictionby Wenzhao Jiang, Jindong Han, Hao Liu, Tao…
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Expertsby Haoxiang Wang, Wei Xiong, Tengyang Xie, Han…