Summary of Code-switching Red-teaming: Llm Evaluation For Safety and Multilingual Understanding, by Haneul Yoo et al.
Code-Switching Red-Teaming: LLM Evaluation for Safety and Multilingual Understandingby Haneul Yoo, Yongjin Yang, Hwaran LeeFirst…
Code-Switching Red-Teaming: LLM Evaluation for Safety and Multilingual Understandingby Haneul Yoo, Yongjin Yang, Hwaran LeeFirst…
PKU-SafeRLHF: Towards Multi-Level Safety Alignment for LLMs with Human Preferenceby Jiaming Ji, Donghai Hong, Borong…
Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Modelby Min Zhao, Hongzhou Zhu, Chendong…
Safe Inputs but Unsafe Output: Benchmarking Cross-modality Safety Alignment of Large Vision-Language Modelby Siyin Wang,…
Camera-Invariant Meta-Learning Network for Single-Camera-Training Person Re-identificationby Jiangbo Pei, Zhuqing Jiang, Aidong Men, Haiying Wang,…
GIEBench: Towards Holistic Evaluation of Group Identity-based Empathy for Large Language Modelsby Leyan Wang, Yonggang…
Artificial Leviathan: Exploring Social Evolution of LLM Agents Through the Lens of Hobbesian Social Contract…
SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Datasetby Josef Dai, Tianle…
GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large Language Modelsby Tao Zhang, Ziqian…
ViLCo-Bench: VIdeo Language COntinual learning Benchmarkby Tianqi Tang, Shohreh Deldari, Hao Xue, Celso De Melo,…