Summary of Vaccine: Perturbation-aware Alignment For Large Language Models Against Harmful Fine-tuning Attack, by Tiansheng Huang et al.
Vaccine: Perturbation-aware Alignment for Large Language Models against Harmful Fine-tuning Attackby Tiansheng Huang, Sihao Hu,…