Alert button

Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization

Add code
Bookmark button
Alert button
Nov 15, 2023
Zhexin Zhang, Junxiao Yang, Pei Ke, Minlie Huang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: