Alert button
Picture for Moonkyung Ryu

Moonkyung Ryu

Alert button

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

Add code
Bookmark button
Alert button
May 25, 2023
Ying Fan, Olivia Watkins, Yuqing Du, Hao Liu, Moonkyung Ryu, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee

Figure 1 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 2 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 3 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 4 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Viaarxiv icon

Aligning Text-to-Image Models using Human Feedback

Add code
Bookmark button
Alert button
Feb 23, 2023
Kimin Lee, Hao Liu, Moonkyung Ryu, Olivia Watkins, Yuqing Du, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Shixiang Shane Gu

Figure 1 for Aligning Text-to-Image Models using Human Feedback
Figure 2 for Aligning Text-to-Image Models using Human Feedback
Figure 3 for Aligning Text-to-Image Models using Human Feedback
Figure 4 for Aligning Text-to-Image Models using Human Feedback
Viaarxiv icon

Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 25, 2022
Deborah Cohen, Moonkyung Ryu, Yinlam Chow, Orgad Keller, Ido Greenberg, Avinatan Hassidim, Michael Fink, Yossi Matias, Idan Szpektor, Craig Boutilier, Gal Elidan

Figure 1 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Figure 2 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Figure 3 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Figure 4 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Viaarxiv icon

CAQL: Continuous Action Q-Learning

Add code
Bookmark button
Alert button
Oct 09, 2019
Moonkyung Ryu, Yinlam Chow, Ross Anderson, Christian Tjandraatmadja, Craig Boutilier

Figure 1 for CAQL: Continuous Action Q-Learning
Figure 2 for CAQL: Continuous Action Q-Learning
Figure 3 for CAQL: Continuous Action Q-Learning
Figure 4 for CAQL: Continuous Action Q-Learning
Viaarxiv icon