Summary of Token-efficient Leverage Learning in Large Language Models, by Yuanhao Zeng et al.
Token-Efficient Leverage Learning in Large Language Modelsby Yuanhao Zeng, Min Wang, Yihang Wang, Yingxia ShaoFirst…
Token-Efficient Leverage Learning in Large Language Modelsby Yuanhao Zeng, Min Wang, Yihang Wang, Yingxia ShaoFirst…
A General and Efficient Training for Transformer via Token Expansionby Wenxuan Huang, Yunhang Shen, Jiao…
On Large Language Models’ Hallucination with Regard to Known Factsby Che Jiang, Biqing Qi, Xiangyu…
Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttentionby Bin Gao, Zhuomin He, Puru…
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selectionby Ali Behrouz, Michele…
Non-Linear Inference Time Intervention: Improving LLM Truthfulnessby Jakub Hoscilowicz, Adam Wiacek, Jan Chojnacki, Adam Cieslak,…
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and Time-Series Analysisby Badri N. Patro, Suhas…
Transcribing Bengali Text with Regional Dialects to IPA using District Guided Tokensby S M Jishanul…
VCR-Graphormer: A Mini-batch Graph Transformer via Virtual Connectionsby Dongqi Fu, Zhigang Hua, Yan Xie, Jin…
Lexicon-Level Contrastive Visual-Grounding Improves Language Modelingby Chengxu Zhuang, Evelina Fedorenko, Jacob AndreasFirst submitted to arxiv…