Summary of Teaching Large Language Models to Reason with Reinforcement Learning, by Alex Havrilla et al.
Teaching Large Language Models to Reason with Reinforcement Learningby Alex Havrilla, Yuqing Du, Sharath Chandra…
Teaching Large Language Models to Reason with Reinforcement Learningby Alex Havrilla, Yuqing Du, Sharath Chandra…
Advancing Out-of-Distribution Detection through Data Purification and Dynamic Activation Function Designby Yingrui Ji, Yao Zhu,…
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projectionby Jiawei Zhao, Zhenyu Zhang, Beidi Chen, Zhangyang…
MathScale: Scaling Instruction Tuning for Mathematical Reasoningby Zhengyang Tang, Xingxing Zhang, Benyou Wang, Furu WeiFirst…
Android in the Zoo: Chain-of-Action-Thought for GUI Agentsby Jiwen Zhang, Jihao Wu, Yihua Teng, Minghui…
Enhancing LLM Safety via Constrained Direct Preference Optimizationby Zixuan Liu, Xiaolin Sun, Zizhan ZhengFirst submitted…
RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Modelsby Saeed Najafi, Alona FysheFirst…
Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Modelsby…
TPLLM: A Traffic Prediction Framework Based on Pretrained Large Language Modelsby Yilong Ren, Yue Chen,…
ComS2T: A complementary spatiotemporal learning system for data-adaptive model evolutionby Zhengyang Zhou, Qihe Huang, Binwu…