Summary of Atoxia: Red-teaming Large Language Models with Target Toxic Answers, by Yuhao Du et al.
Atoxia: Red-teaming Large Language Models with Target Toxic Answersby Yuhao Du, Zhuo Li, Pengyu Cheng,…
Atoxia: Red-teaming Large Language Models with Target Toxic Answersby Yuhao Du, Zhuo Li, Pengyu Cheng,…
Enhancing Analogical Reasoning in the Abstraction and Reasoning Corpus via Model-Based RLby Jihwan Lee, Woochang…
DynamicRouteGPT: A Real-Time Multi-Vehicle Dynamic Navigation Framework Based on Large Language Modelsby Ziai Zhou, Bin…
Multi-Agent Target Assignment and Path Finding for Intelligent Warehouse: A Cooperative Multi-Agent Deep Reinforcement Learning…
DutyTTE: Deciphering Uncertainty in Origin-Destination Travel Time Estimationby Xiaowei Mao, Yan Lin, Shengnan Guo, Yubin…
Intelligent OPC Engineer Assistant for Semiconductor Manufacturingby Guojin Chen, Haoyu Yang, Bei Yu, Haoxing RenFirst…
S-EPOA: Overcoming the Indistinguishability of Segments with Skill-Driven Preference-Based Reinforcement Learningby Ni Mu, Yao Luan,…
Bridging Large Language Models and Optimization: A Unified Framework for Text-attributed Combinatorial Optimizationby Xia Jiang,…
SCREENER: A general framework for task-specific experiment design in quantitative MRIby Tianshu Zheng, Zican Wang,…
Advances in Preference-based Reinforcement Learning: A Reviewby Youssef Abdelkareem, Shady Shehata, Fakhri KarrayFirst submitted to…