Picture for Jinghui Chen

Jinghui Chen

PromptFix: Few-shot Backdoor Removal via Adversarial Prompt Tuning

Jun 06, 2024
Viaarxiv icon

Graph Adversarial Diffusion Convolution

Add code
Jun 04, 2024
Viaarxiv icon

XPrompt:Explaining Large Language Model's Generation via Joint Prompt Attribution

May 30, 2024
Viaarxiv icon

Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization

May 28, 2024
Viaarxiv icon

WordGame: Efficient & Effective LLM Jailbreak via Simultaneous Obfuscation in Query and Response

Add code
May 22, 2024
Viaarxiv icon

VQAttack: Transferable Adversarial Attacks on Visual Question Answering via Pre-trained Models

Feb 16, 2024
Viaarxiv icon

Federated Learning with Projected Trajectory Regularization

Dec 22, 2023
Viaarxiv icon

On the Difficulty of Defending Contrastive Learning against Backdoor Attacks

Dec 14, 2023
Viaarxiv icon

Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections

Nov 15, 2023
Viaarxiv icon

IMPRESS: Evaluating the Resilience of Imperceptible Perturbations Against Unauthorized Data Usage in Diffusion-Based Generative AI

Add code
Oct 30, 2023
Viaarxiv icon