Summary of Kto: Model Alignment As Prospect Theoretic Optimization, by Kawin Ethayarajh et al.
KTO: Model Alignment as Prospect Theoretic Optimizationby Kawin Ethayarajh, Winnie Xu, Niklas Muennighoff, Dan Jurafsky,…
KTO: Model Alignment as Prospect Theoretic Optimizationby Kawin Ethayarajh, Winnie Xu, Niklas Muennighoff, Dan Jurafsky,…
SignSGD with Federated Defense: Harnessing Adversarial Attacks through Gradient Sign Decodingby Chanho Park, Namyoon LeeFirst…
Fundamental Properties of Causal Entropy and Information Gainby Francisco N. F. Q. Simoes, Mehdi Dastani,…
Training-time Neuron Alignment through Permutation Subspace for Improving Linear Mode Connectivity and Model Fusionby Zexi…
Monotone, Bi-Lipschitz, and Polyak-Lojasiewicz Networksby Ruigang Wang, Krishnamurthy Dvijotham, Ian R. ManchesterFirst submitted to arxiv…
Shapelet-based Model-agnostic Counterfactual Local Explanations for Time Series Classificationby Qi Huang, Wei Chen, Thomas Bäck,…
CORE: Mitigating Catastrophic Forgetting in Continual Learning through Cognitive Replayby Jianshu Zhang, Yankai Fu, Ziheng…
pFedMoE: Data-Level Personalization with Mixture of Experts for Model-Heterogeneous Personalized Federated Learningby Liping Yi, Han…
Continual Learning for Large Language Models: A Surveyby Tongtong Wu, Linhao Luo, Yuan-Fang Li, Shirui…
TESSERACT: Eliminating Experimental Bias in Malware Classification across Space and Time (Extended Version)by Zeliang Kan,…