Alert button

Eraser: Jailbreaking Defense in Large Language Models via Unlearning Harmful Knowledge

Apr 08, 2024
Weikai Lu, Ziqian Zeng, Jianwei Wang, Zhengdong Lu, Zelin Chen, Huiping Zhuang, Cen Chen

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: