Summary of Reinforcement Learning with Token-level Feedback For Controllable Text Generation, by Wendi Li et al.
Reinforcement Learning with Token-level Feedback for Controllable Text Generationby Wendi Li, Wei Wei, Kaihe Xu,…
Reinforcement Learning with Token-level Feedback for Controllable Text Generationby Wendi Li, Wei Wei, Kaihe Xu,…
Semiparametric Token-Sequence Co-Supervisionby Hyunji Lee, Doyoung Kim, Jihoon Jun, Sejune Joo, Joel Jang, Kyoung-Woon On,…
UniCode: Learning a Unified Codebook for Multimodal Large Language Modelsby Sipeng Zheng, Bohan Zhou, Yicheng…
Token Alignment via Character Matching for Subword Completionby Ben Athiwaratkun, Shiqi Wang, Mingyue Shang, Yuchen…
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Modelsby Ning Ding, Yulin…
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLMby Sainbayar Sukhbaatar, Olga Golovneva, Vasu Sharma, Hu…
TextMonkey: An OCR-Free Large Multimodal Model for Understanding Documentby Yuliang Liu, Biao Yang, Qiang Liu,…
ToolNet: Connecting Large Language Models with Massive Tools via Tool Graphby Xukun Liu, Zhiyuan Peng,…
Resonance RoPE: Improving Context Length Generalization of Large Language Modelsby Suyuchen Wang, Ivan Kobyzev, Peng…