Summary of Style Outweighs Substance: Failure Modes Of Llm Judges in Alignment Benchmarking, by Benjamin Feuer et al.
Style Outweighs Substance: Failure Modes of LLM Judges in Alignment Benchmarkingby Benjamin Feuer, Micah Goldblum,…
Style Outweighs Substance: Failure Modes of LLM Judges in Alignment Benchmarkingby Benjamin Feuer, Micah Goldblum,…
Dynamic Integration of Task-Specific Adapters for Class Incremental Learningby Jiashuo Li, Shaokun Wang, Bo Qian,…
Backtracking Improves Generation Safetyby Yiming Zhang, Jianfeng Chi, Hailey Nguyen, Kartikeya Upasani, Daniel M. Bikel,…
The FIX Benchmark: Extracting Features Interpretable to eXpertsby Helen Jin, Shreya Havaldar, Chaehyeon Kim, Anton…
RLHFuse: Efficient RLHF Training for Large Language Models with Inter- and Intra-Stage Fusionby Yinmin Zhong,…
Interpolating Video-LLMs: Toward Longer-sequence LMMs in a Training-free Mannerby Yuzhang Shang, Bingxin Xu, Weitai Kang,…
Extracting Memorized Training Data via Decompositionby Ellen Su, Anu Vellore, Amy Chang, Raffaele Mura, Blaine…
LogoRA: Local-Global Representation Alignment for Robust Time Series Classificationby Huanyu Zhang, Yi-Fan Zhang, Zhang Zhang,…
Unsupervised Domain Adaptation Via Data Pruningby Andrea Napoli, Paul WhiteFirst submitted to arxiv on: 18…
Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Surveyby Genta Indra…