Alert button
Picture for Xiong-Hui Chen

Xiong-Hui Chen

Alert button

Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts

Add code
Bookmark button
Alert button
Apr 14, 2024
Jing-Cheng Pang, Si-Hang Yang, Kaiyuan Li, Jiaji Zhang, Xiong-Hui Chen, Nan Tang, Yang Yu

Viaarxiv icon

Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments

Add code
Bookmark button
Alert button
Oct 09, 2023
Xiong-Hui Chen, Junyin Ye, Hang Zhao, Yi-Chen Li, Haoran Shi, Yu-Yan Xu, Zhihao Ye, Si-Hang Yang, Anqi Huang, Kai Xu, Zongzhang Zhang, Yang Yu

Figure 1 for Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Figure 2 for Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Figure 3 for Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Figure 4 for Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Viaarxiv icon

Language Model Self-improvement by Reinforcement Learning Contemplation

Add code
Bookmark button
Alert button
May 23, 2023
Jing-Cheng Pang, Pengyuan Wang, Kaiyuan Li, Xiong-Hui Chen, Jiacheng Xu, Zongzhang Zhang, Yang Yu

Figure 1 for Language Model Self-improvement by Reinforcement Learning Contemplation
Figure 2 for Language Model Self-improvement by Reinforcement Learning Contemplation
Figure 3 for Language Model Self-improvement by Reinforcement Learning Contemplation
Figure 4 for Language Model Self-improvement by Reinforcement Learning Contemplation
Viaarxiv icon

Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems

Add code
Bookmark button
Alert button
May 03, 2023
Xiong-Hui Chen, Bowei He, Yang Yu, Qingyang Li, Zhiwei Qin, Wenjie Shang, Jieping Ye, Chen Ma

Figure 1 for Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems
Figure 2 for Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems
Figure 3 for Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems
Figure 4 for Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems
Viaarxiv icon

A Survey on Model-based Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 19, 2022
Fan-Ming Luo, Tian Xu, Hang Lai, Xiong-Hui Chen, Weinan Zhang, Yang Yu

Figure 1 for A Survey on Model-based Reinforcement Learning
Viaarxiv icon

Adversarial Counterfactual Environment Model Learning

Add code
Bookmark button
Alert button
Jun 10, 2022
Xiong-Hui Chen, Yang Yu, Zheng-Mao Zhu, Zhihua Yu, Zhenjun Chen, Chenghe Wang, Yinan Wu, Hongqiu Wu, Rong-Jun Qin, Ruijin Ding, Fangsheng Huang

Figure 1 for Adversarial Counterfactual Environment Model Learning
Figure 2 for Adversarial Counterfactual Environment Model Learning
Figure 3 for Adversarial Counterfactual Environment Model Learning
Figure 4 for Adversarial Counterfactual Environment Model Learning
Viaarxiv icon

Offline Reinforcement Learning with Causal Structured World Models

Add code
Bookmark button
Alert button
Jun 03, 2022
Zheng-Mao Zhu, Xiong-Hui Chen, Hong-Long Tian, Kun Zhang, Yang Yu

Figure 1 for Offline Reinforcement Learning with Causal Structured World Models
Figure 2 for Offline Reinforcement Learning with Causal Structured World Models
Figure 3 for Offline Reinforcement Learning with Causal Structured World Models
Figure 4 for Offline Reinforcement Learning with Causal Structured World Models
Viaarxiv icon