Alert button

Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels

Add code
Bookmark button
Alert button
Apr 22, 2024
Jan-Philipp Fränken, Eric Zelikman, Rafael Rafailov, Kanishk Gandhi, Tobias Gerstenberg, Noah D. Goodman

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: