Summary of Exploring the Landscape Of Large Language Models: Foundations, Techniques, and Challenges, by Milad Moradi et al.
Exploring the landscape of large language models: Foundations, techniques, and challengesby Milad Moradi, Ke Yan,…
Exploring the landscape of large language models: Foundations, techniques, and challengesby Milad Moradi, Ke Yan,…
X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learnerby Haoyuan…
Learn to Tour: Operator Design For Solution Feasibility Mapping in Pickup-and-delivery Traveling Salesman Problemby Bowen…
Uncertainty-Based Abstention in LLMs Improves Safety and Reduces Hallucinationsby Christian Tomani, Kamalika Chaudhuri, Ivan Evtimov,…
Simplex Decomposition for Portfolio Allocation Constraints in Reinforcement Learningby David Winkel, Niklas Strauß, Matthias Schubert,…
N-Agent Ad Hoc Teamworkby Caroline Wang, Arrasy Rahman, Ishan Durugkar, Elad Liebman, Peter StoneFirst submitted…
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RLby Fangwei Zhong, Kui Wu,…
Reinforcement Learning from Multi-role Debates as Feedback for Bias Mitigation in LLMsby Ruoxi Cheng, Haoxuan…
Improving Language Model Reasoning with Self-motivated Learningby Yunlong Feng, Yang Xu, Libo Qin, Yasheng Wang,…
Towards Understanding the Influence of Reward Margin on Preference Model Performanceby Bowen Qin, Duanyu Feng,…