Summary of Generating and Evolving Reward Functions For Highway Driving with Large Language Models, by Xu Han et al.
Generating and Evolving Reward Functions for Highway Driving with Large Language Modelsby Xu Han, Qiannan…
Generating and Evolving Reward Functions for Highway Driving with Large Language Modelsby Xu Han, Qiannan…
Prompt-Based Length Controlled Generation with Multiple Control Typesby Renlong Jie, Xiaojun Meng, Lifeng Shang, Xin…
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Modelsby Carson Denison, Monte MacDiarmid, Fazl Barez,…
Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithmsby Miaosen Zhang, Yixuan Wei,…
SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasetsby Shenghua Wan, Ziyuan Chen,…
Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectorsby Zhenglong Luo, Zhiyong Chen, James WelshFirst…
EXPIL: Explanatory Predicate Invention for Learning in Gamesby Jingyuan Sha, Hikaru Shindo, Quentin Delfosse, Kristian…
Multi-attribute Auction-based Resource Allocation for Twins Migration in Vehicular Metaverses: A GPT-based DRL Approachby Yongju…
Diffusion-based Reinforcement Learning for Dynamic UAV-assisted Vehicle Twins Migration in Vehicular Metaversesby Yongju Tong, Jiawen…
Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learningby…