Alert button

Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble

Jan 30, 2024
Shun Zhang, Zhenfang Chen, Sunli Chen, Yikang Shen, Zhiqing Sun, Chuang Gan

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: