Summary of Seal: Safety-enhanced Aligned Llm Fine-tuning Via Bilevel Data Selection, by Han Shen and Pin-yu Chen and Payel Das and Tianyi Chen
SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selectionby Han Shen, Pin-Yu Chen, Payel Das,…