Summary of Robust Prompt Optimization For Defending Language Models Against Jailbreaking Attacks, by Andy Zhou and Bo Li and Haohan Wang
Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacksby Andy Zhou, Bo Li, Haohan…