Summary of Adamoe: Token-adaptive Routing with Null Experts For Mixture-of-experts Language Models, by Zihao Zeng et al.
AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Modelsby Zihao Zeng, Yibo Miao, Hongcheng…
AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Modelsby Zihao Zeng, Yibo Miao, Hongcheng…
NeuroMoCo: A Neuromorphic Momentum Contrast Learning Method for Spiking Neural Networksby Yuqi Ma, Huamin Wang,…
TTM-RE: Memory-Augmented Document-Level Relation Extractionby Chufan Gao, Xuan Wang, Jimeng SunFirst submitted to arxiv on:…
No Captions, No Problem: Captionless 3D-CLIP Alignment with Hard Negatives via CLIP Knowledge and LLMsby…
Direct Alignment of Language Models via Quality-Aware Self-Refinementby Runsheng Yu, Yong Wang, Xiaoqi Jiao, Youzhi…
CV-VAE: A Compatible Video VAE for Latent Generative Video Modelsby Sijie Zhao, Yong Zhang, Xiaodong…
Instruction Tuning With Loss Over Instructionsby Zhengyan Shi, Adam X. Yang, Bin Wu, Laurence Aitchison,…
Rethink Predicting the Optical Flow with the Kinetics Perspectiveby Yuhao Cheng, Siru Zhang, Yiqiang YanFirst…
Single Image Unlearning: Efficient Machine Unlearning in Multimodal Large Language Modelsby Jiaqi Li, Qianshan Wei,…
Searching Realistic-Looking Adversarial Objects For Autonomous Driving Systemsby Shengxiang Sun, Shenzhe ZhuFirst submitted to arxiv…