Alert button
Picture for Tianbing Xu

Tianbing Xu

Alert button

WALL-E: An Efficient Reinforcement Learning Research Framework

Add code
Bookmark button
Alert button
Jan 28, 2019
Tianbing Xu, Andrew Zhang, Liang Zhao

Figure 1 for WALL-E: An Efficient Reinforcement Learning Research Framework
Figure 2 for WALL-E: An Efficient Reinforcement Learning Research Framework
Figure 3 for WALL-E: An Efficient Reinforcement Learning Research Framework
Figure 4 for WALL-E: An Efficient Reinforcement Learning Research Framework
Viaarxiv icon

Stochastic Variance Reduction for Policy Gradient Estimation

Add code
Bookmark button
Alert button
Mar 29, 2018
Tianbing Xu, Qiang Liu, Jian Peng

Figure 1 for Stochastic Variance Reduction for Policy Gradient Estimation
Figure 2 for Stochastic Variance Reduction for Policy Gradient Estimation
Figure 3 for Stochastic Variance Reduction for Policy Gradient Estimation
Figure 4 for Stochastic Variance Reduction for Policy Gradient Estimation
Viaarxiv icon

Learning to Explore with Meta-Policy Gradient

Add code
Bookmark button
Alert button
Mar 26, 2018
Tianbing Xu, Qiang Liu, Liang Zhao, Jian Peng

Figure 1 for Learning to Explore with Meta-Policy Gradient
Figure 2 for Learning to Explore with Meta-Policy Gradient
Figure 3 for Learning to Explore with Meta-Policy Gradient
Figure 4 for Learning to Explore with Meta-Policy Gradient
Viaarxiv icon

Variational Inference for Policy Gradient

Add code
Bookmark button
Alert button
Mar 25, 2018
Tianbing Xu

Viaarxiv icon

Thompson Sampling in Dynamic Systems for Contextual Bandit Problems

Add code
Bookmark button
Alert button
Oct 17, 2013
Tianbing Xu, Yaming Yu, John Turner, Amelia Regan

Figure 1 for Thompson Sampling in Dynamic Systems for Contextual Bandit Problems
Figure 2 for Thompson Sampling in Dynamic Systems for Contextual Bandit Problems
Figure 3 for Thompson Sampling in Dynamic Systems for Contextual Bandit Problems
Figure 4 for Thompson Sampling in Dynamic Systems for Contextual Bandit Problems
Viaarxiv icon

Online Classification Using a Voted RDA Method

Add code
Bookmark button
Alert button
Oct 17, 2013
Tianbing Xu, Jianfeng Gao, Lin Xiao, Amelia Regan

Figure 1 for Online Classification Using a Voted RDA Method
Figure 2 for Online Classification Using a Voted RDA Method
Figure 3 for Online Classification Using a Voted RDA Method
Figure 4 for Online Classification Using a Voted RDA Method
Viaarxiv icon