Summary of Cannot or Should Not? Automatic Analysis Of Refusal Composition in Ift/rlhf Datasets and Refusal Behavior Of Black-box Llms, by Alexander Von Recum et al.
Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT/RLHF Datasets and Refusal Behavior…