Summary of Toward Inference-optimal Mixture-of-expert Large Language Models, by Longfei Yun et al.
Toward Inference-optimal Mixture-of-Expert Large Language Modelsby Longfei Yun, Yonghao Zhuang, Yao Fu, Eric P Xing,…
Toward Inference-optimal Mixture-of-Expert Large Language Modelsby Longfei Yun, Yonghao Zhuang, Yao Fu, Eric P Xing,…
Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt Optimizationby Tobias Schnabel, Jennifer…
TWIN-GPT: Digital Twins for Clinical Trials via Large Language Modelby Yue Wang, Tianfan Fu, Yinlong…
Explaining Large Language Models Decisions Using Shapley Valuesby Behnam MohammadiFirst submitted to arxiv on: 29…
Harnessing the Power of Large Language Model for Uncertainty Aware Graph Processingby Zhenyu Qian, Yiming…
Extensive Self-Contrast Enables Feedback-Free Language Model Alignmentby Xiao Liu, Xixuan Song, Yuxiao Dong, Jie TangFirst…
Zero-shot Safety Prediction for Autonomous Robots with Foundation World Modelsby Zhenjiang Mao, Siqi Dai, Yuang…
MANGO: A Benchmark for Evaluating Mapping and Navigation Abilities of Large Language Modelsby Peng Ding,…
Jamba: A Hybrid Transformer-Mamba Language Modelby Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan…
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performanceby Jiasheng Ye, Peiju Liu,…