Summary of Just Say What You Want: Only-prompting Self-rewarding Online Preference Optimization, by Ruijie Xu et al.
Just Say What You Want: Only-prompting Self-rewarding Online Preference Optimizationby Ruijie Xu, Zhihan Liu, Yongfei…
Just Say What You Want: Only-prompting Self-rewarding Online Preference Optimizationby Ruijie Xu, Zhihan Liu, Yongfei…
CleanerCLIP: Fine-grained Counterfactual Semantic Augmentation for Backdoor Defense in Contrastive Learningby Yuan Xun, Siyuan Liang,…
REAL: Response Embedding-based Alignment for LLMsby Honggen Zhang, Xufeng Zhao, Igor Molybog, June ZhangFirst submitted…
Post-hoc Reward Calibration: A Case Study on Length Biasby Zeyu Huang, Zihan Qiu, Zili Wang,…
Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn…
Explaining Human Comparisons using Alignment-Importance Heatmapsby Nhut Truong, Dario Pesenti, Uri HassonFirst submitted to arxiv…
Mitigating Semantic Leakage in Cross-lingual Embeddings via Orthogonality Constraintby Dayeon Ki, Cheonbok Park, Hyunjoong KimFirst…
StarVid: Enhancing Semantic Alignment in Video Diffusion Models via Spatial and SynTactic Guided Attention Refocusingby…
TS-HTFA: Advancing Time Series Forecasting via Hierarchical Text-Free Alignment with Large Language Modelsby Pengfei Wang,…