Picture for Bowen Yu

Bowen Yu

Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment

Add code
May 28, 2024
Viaarxiv icon

Language Models can Evaluate Themselves via Probability Discrepancy

Add code
May 17, 2024
Viaarxiv icon

Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment

Add code
Mar 30, 2024
Figure 1 for Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment
Figure 2 for Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment
Figure 3 for Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment
Figure 4 for Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment
Viaarxiv icon

SoFA: Shielded On-the-fly Alignment via Priority Rule Following

Add code
Feb 27, 2024
Figure 1 for SoFA: Shielded On-the-fly Alignment via Priority Rule Following
Figure 2 for SoFA: Shielded On-the-fly Alignment via Priority Rule Following
Figure 3 for SoFA: Shielded On-the-fly Alignment via Priority Rule Following
Figure 4 for SoFA: Shielded On-the-fly Alignment via Priority Rule Following
Viaarxiv icon

Self-Retrieval: Building an Information Retrieval System with One Large Language Model

Add code
Feb 23, 2024
Figure 1 for Self-Retrieval: Building an Information Retrieval System with One Large Language Model
Figure 2 for Self-Retrieval: Building an Information Retrieval System with One Large Language Model
Figure 3 for Self-Retrieval: Building an Information Retrieval System with One Large Language Model
Figure 4 for Self-Retrieval: Building an Information Retrieval System with One Large Language Model
Viaarxiv icon

Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment

Add code
Jan 23, 2024
Viaarxiv icon

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch

Add code
Nov 06, 2023
Figure 1 for Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch
Figure 2 for Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch
Figure 3 for Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch
Figure 4 for Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch
Viaarxiv icon

Diversify Question Generation with Retrieval-Augmented Style Transfer

Add code
Oct 23, 2023
Figure 1 for Diversify Question Generation with Retrieval-Augmented Style Transfer
Figure 2 for Diversify Question Generation with Retrieval-Augmented Style Transfer
Figure 3 for Diversify Question Generation with Retrieval-Augmented Style Transfer
Figure 4 for Diversify Question Generation with Retrieval-Augmented Style Transfer
Viaarxiv icon

Improving Question Generation with Multi-level Content Planning

Add code
Oct 23, 2023
Viaarxiv icon

Quantifying and mitigating the impact of label errors on model disparity metrics

Add code
Oct 04, 2023
Figure 1 for Quantifying and mitigating the impact of label errors on model disparity metrics
Figure 2 for Quantifying and mitigating the impact of label errors on model disparity metrics
Figure 3 for Quantifying and mitigating the impact of label errors on model disparity metrics
Figure 4 for Quantifying and mitigating the impact of label errors on model disparity metrics
Viaarxiv icon