Summary of Safety Alignment Backfires: Preventing the Re-emergence Of Suppressed Concepts in Fine-tuned Text-to-image Diffusion Models, by Sanghyun Kim et al.
Safety Alignment Backfires: Preventing the Re-emergence of Suppressed Concepts in Fine-tuned Text-to-Image Diffusion Modelsby Sanghyun…