Summary of Buster: Implanting Semantic Backdoor Into Text Encoder to Mitigate Nsfw Content Generation, by Xin Zhao et al.

Buster: Implanting Semantic Backdoor into Text Encoder to Mitigate NSFW Content Generation

by Xin Zhao, Xiaojun Chen, Yuexin Xuan, Zhendong Zhao, Xiaojun Jia, Xinfeng Li, Xiaofeng Wang

First submitted to arxiv on: 10 Dec 2024

GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty	Written by	Summary
High	Paper authors	High Difficulty Summary Read the original abstract here
Medium	GrooveSquid.com (original content)	Medium Difficulty Summary The proposed framework, called Buster, addresses concerns about Not-Safe-for-Work (NSFW) content generation in deep learning models by injecting backdoors into text encoders to prevent harmful content creation. This innovative approach leverages deep semantic information rather than explicit prompts as triggers, redirecting NSFW prompts towards targeted benign prompts. Buster also employs energy-based training data generation through Langevin dynamics for adversarial knowledge augmentation, ensuring robustness in defining harmful concepts. The framework fine-tunes text encoders of Text-to-Image models efficiently and outperforms nine state-of-the-art baselines, achieving a superior NSFW content removal rate of at least 91.2% while preserving harmless image quality.
Low	GrooveSquid.com (original content)	Low Difficulty Summary Buster is a new way to stop bad images from being made on the internet. Right now, people use special tricks to make sure no naughty pictures are created, but these methods can be slow and not very good at catching all the bad stuff. Buster works differently by adding secret codes to the computer’s brain so it knows when someone is trying to make a yucky image. This helps keep the internet clean and safe for everyone.

Keywords

» Artificial intelligence » Deep learning

Buster: Implanting Semantic Backdoor into Text Encoder to Mitigate NSFW Content Generation

by Xin Zhao, Xiaojun Chen, Yuexin Xuan, Zhendong Zhao, Xiaojun Jia, Xinfeng Li, Xiaofeng Wang

Categories

GrooveSquid.com Paper Summaries

Keywords

Summary of A Dynamical Systems-inspired Pruning Strategy For Addressing Oversmoothing in Graph Neural Networks, by Biswadeep Chakraborty et al.

Summary of Real-time Sign Language Recognition Using Mobilenetv2 and Transfer Learning, by Smruti Jagtap et al.

Related Posts