Alert button
Picture for Lei Guan

Lei Guan

Alert button

PipeOptim: Ensuring Effective 1F1B Schedule with Optimizer-Dependent Weight Prediction

Add code
Bookmark button
Alert button
Dec 05, 2023
Lei Guan, Dongsheng Li, Jiye Liang, Wenjian Wang, Xicheng Lu

Viaarxiv icon

AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment on AdamW Basis

Add code
Bookmark button
Alert button
Sep 05, 2023
Lei Guan

Figure 1 for AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment on AdamW Basis
Figure 2 for AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment on AdamW Basis
Figure 3 for AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment on AdamW Basis
Figure 4 for AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment on AdamW Basis
Viaarxiv icon

XGrad: Boosting Gradient-Based Optimizers With Weight Prediction

Add code
Bookmark button
Alert button
May 26, 2023
Lei Guan, Dongsheng Li, Jian Meng, Yanqi Shi

Figure 1 for XGrad: Boosting Gradient-Based Optimizers With Weight Prediction
Figure 2 for XGrad: Boosting Gradient-Based Optimizers With Weight Prediction
Figure 3 for XGrad: Boosting Gradient-Based Optimizers With Weight Prediction
Figure 4 for XGrad: Boosting Gradient-Based Optimizers With Weight Prediction
Viaarxiv icon

Weight Prediction Boosts the Convergence of AdamW

Add code
Bookmark button
Alert button
Feb 01, 2023
Lei Guan

Figure 1 for Weight Prediction Boosts the Convergence of AdamW
Figure 2 for Weight Prediction Boosts the Convergence of AdamW
Figure 3 for Weight Prediction Boosts the Convergence of AdamW
Figure 4 for Weight Prediction Boosts the Convergence of AdamW
Viaarxiv icon

XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training

Add code
Bookmark button
Alert button
Nov 20, 2019
Lei Guan, Wotao Yin, Dongsheng Li, Xicheng Lu

Figure 1 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training
Figure 2 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training
Figure 3 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training
Figure 4 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training
Viaarxiv icon

Non-ergodic Convergence Analysis of Heavy-Ball Algorithms

Add code
Bookmark button
Alert button
Nov 09, 2018
Tao Sun, Penghang Yin, Dongsheng Li, Chun Huang, Lei Guan, Hao Jiang

Figure 1 for Non-ergodic Convergence Analysis of Heavy-Ball Algorithms
Viaarxiv icon

An Efficient ADMM-Based Algorithm to Nonconvex Penalized Support Vector Machines

Add code
Bookmark button
Alert button
Sep 11, 2018
Lei Guan, Linbo Qiao, Dongsheng Li, Tao Sun, Keshi Ge, Xicheng Lu

Figure 1 for An Efficient ADMM-Based Algorithm to Nonconvex Penalized Support Vector Machines
Figure 2 for An Efficient ADMM-Based Algorithm to Nonconvex Penalized Support Vector Machines
Figure 3 for An Efficient ADMM-Based Algorithm to Nonconvex Penalized Support Vector Machines
Figure 4 for An Efficient ADMM-Based Algorithm to Nonconvex Penalized Support Vector Machines
Viaarxiv icon