Alert button
Picture for Chenjun Xiao

Chenjun Xiao

Alert button

An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models

Add code
Bookmark button
Alert button
Apr 23, 2024
Yangchen Pan, Junfeng Wen, Chenjun Xiao, Philip Torr

Viaarxiv icon

Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 20, 2023
Hongming Zhang, Tongzheng Ren, Chenjun Xiao, Dale Schuurmans, Bo Dai

Viaarxiv icon

Rethinking Decision Transformer via Hierarchical Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 01, 2023
Yi Ma, Chenjun Xiao, Hebin Liang, Jianye Hao

Viaarxiv icon

In-Sample Policy Iteration for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 09, 2023
Xiaohan Hu, Yi Ma, Chenjun Xiao, Yan Zheng, Zhaopeng Meng

Figure 1 for In-Sample Policy Iteration for Offline Reinforcement Learning
Figure 2 for In-Sample Policy Iteration for Offline Reinforcement Learning
Figure 3 for In-Sample Policy Iteration for Offline Reinforcement Learning
Figure 4 for In-Sample Policy Iteration for Offline Reinforcement Learning
Viaarxiv icon

Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 16, 2023
Xutong Zhao, Yangchen Pan, Chenjun Xiao, Sarath Chandar, Janarthanan Rajendran

Figure 1 for Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Figure 2 for Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Figure 3 for Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Figure 4 for Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Viaarxiv icon

The In-Sample Softmax for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 28, 2023
Chenjun Xiao, Han Wang, Yangchen Pan, Adam White, Martha White

Figure 1 for The In-Sample Softmax for Offline Reinforcement Learning
Figure 2 for The In-Sample Softmax for Offline Reinforcement Learning
Figure 3 for The In-Sample Softmax for Offline Reinforcement Learning
Figure 4 for The In-Sample Softmax for Offline Reinforcement Learning
Viaarxiv icon

Latent Variable Representation for Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 17, 2022
Tongzheng Ren, Chenjun Xiao, Tianjun Zhang, Na Li, Zhaoran Wang, Sujay Sanghavi, Dale Schuurmans, Bo Dai

Figure 1 for Latent Variable Representation for Reinforcement Learning
Figure 2 for Latent Variable Representation for Reinforcement Learning
Figure 3 for Latent Variable Representation for Reinforcement Learning
Figure 4 for Latent Variable Representation for Reinforcement Learning
Viaarxiv icon

Understanding the Effect of Stochasticity in Policy Optimization

Add code
Bookmark button
Alert button
Oct 29, 2021
Jincheng Mei, Bo Dai, Chenjun Xiao, Csaba Szepesvari, Dale Schuurmans

Figure 1 for Understanding the Effect of Stochasticity in Policy Optimization
Figure 2 for Understanding the Effect of Stochasticity in Policy Optimization
Figure 3 for Understanding the Effect of Stochasticity in Policy Optimization
Viaarxiv icon

On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data

Add code
Bookmark button
Alert button
Jun 18, 2021
Chenjun Xiao, Ilbin Lee, Bo Dai, Dale Schuurmans, Csaba Szepesvari

Figure 1 for On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data
Figure 2 for On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data
Viaarxiv icon

On the Optimality of Batch Policy Optimization Algorithms

Add code
Bookmark button
Alert button
Apr 06, 2021
Chenjun Xiao, Yifan Wu, Tor Lattimore, Bo Dai, Jincheng Mei, Lihong Li, Csaba Szepesvari, Dale Schuurmans

Figure 1 for On the Optimality of Batch Policy Optimization Algorithms
Figure 2 for On the Optimality of Batch Policy Optimization Algorithms
Viaarxiv icon