Alert button

Fine-Tuning Language Models from Human Preferences

Add code
Bookmark button
Alert button
Sep 18, 2019
Daniel M. Ziegler, Nisan Stiennon, Jeffrey Wu, Tom B. Brown, Alec Radford, Dario Amodei, Paul Christiano, Geoffrey Irving

Figure 1 for Fine-Tuning Language Models from Human Preferences
Figure 2 for Fine-Tuning Language Models from Human Preferences
Figure 3 for Fine-Tuning Language Models from Human Preferences
Figure 4 for Fine-Tuning Language Models from Human Preferences

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: