Summary of Learning From Response Not Preference: a Stackelberg Approach For Llm Detoxification Using Non-parallel Data, by Xinhong Xie et al.
Learning from Response not Preference: A Stackelberg Approach for LLM Detoxification using Non-parallel Databy Xinhong…