Picture for Rong Bao

Rong Bao

Mitigating Reward Hacking via Information-Theoretic Reward Modeling

Add code
Feb 16, 2024
Viaarxiv icon

Orthogonal Subspace Learning for Language Model Continual Learning

Add code
Oct 22, 2023
Figure 1 for Orthogonal Subspace Learning for Language Model Continual Learning
Figure 2 for Orthogonal Subspace Learning for Language Model Continual Learning
Figure 3 for Orthogonal Subspace Learning for Language Model Continual Learning
Figure 4 for Orthogonal Subspace Learning for Language Model Continual Learning
Viaarxiv icon

Robust Lottery Tickets for Pre-trained Language Models

Add code
Nov 06, 2022
Figure 1 for Robust Lottery Tickets for Pre-trained Language Models
Figure 2 for Robust Lottery Tickets for Pre-trained Language Models
Figure 3 for Robust Lottery Tickets for Pre-trained Language Models
Figure 4 for Robust Lottery Tickets for Pre-trained Language Models
Viaarxiv icon