Alert button
Picture for Zixuan Dong

Zixuan Dong

Alert button

Cross Entropy versus Label Smoothing: A Neural Collapse Perspective

Add code
Bookmark button
Alert button
Feb 07, 2024
Li Guo, Keith Ross, Zifan Zhao, George Andriopoulos, Shuyang Ling, Yufeng Xu, Zixuan Dong

Viaarxiv icon

Pre-training with Synthetic Data Helps Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 06, 2023
Zecheng Wang, Che Wang, Zixuan Dong, Keith Ross

Figure 1 for Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Figure 2 for Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Figure 3 for Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Figure 4 for Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Viaarxiv icon

On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs

Add code
Bookmark button
Alert button
Sep 07, 2022
Zixuan Dong, Che Wang, Keith Ross

Figure 1 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Figure 2 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Figure 3 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Figure 4 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Viaarxiv icon