Summary of Preference Poisoning Attacks on Reward Model Learning, by Junlin Wu et al.
Preference Poisoning Attacks on Reward Model Learningby Junlin Wu, Jiongxiao Wang, Chaowei Xiao, Chenguang Wang,…
Preference Poisoning Attacks on Reward Model Learningby Junlin Wu, Jiongxiao Wang, Chaowei Xiao, Chenguang Wang,…
A General Framework for Learning from Weak Supervisionby Hao Chen, Jindong Wang, Lei Feng, Xiang…
Robust Counterfactual Explanations in Machine Learning: A Surveyby Junqi Jiang, Francesco Leofante, Antonio Rago, Francesca…
OPSurv: Orthogonal Polynomials Quadrature Algorithm for Survival Analysisby Lilian W. Bialokozowicz, Hoang M. Le, Tristan…
Calibrated Uncertainty Quantification for Operator Learning via Conformal Predictionby Ziqi Ma, Kamyar Azizzadenesheli, Anima AnandkumarFirst…
Position Paper: Assessing Robustness, Privacy, and Fairness in Federated Learning Integrated with Foundation Modelsby Xi…
Socially Aware Synthetic Data Generation for Suicidal Ideation Detection Using Large Language Modelsby Hamideh Ghanadian,…
Systematic Literature Review: Computational Approaches for Humour Style Classificationby Mary Ogbuka Kenneth, Foaad Khosmood, Abbas…
Rethinking Interpretability in the Era of Large Language Modelsby Chandan Singh, Jeevana Priya Inala, Michel…
Disentangling the Roles of Target-Side Transfer and Regularization in Multilingual Machine Translationby Yan Meng, Christof…