Alert button
Picture for Jonathan Hseu

Jonathan Hseu

Alert button

Reducing BERT Pre-Training Time from 3 Days to 76 Minutes

Add code
Bookmark button
Alert button
Apr 01, 2019
Yang You, Jing Li, Jonathan Hseu, Xiaodan Song, James Demmel, Cho-Jui Hsieh

Figure 1 for Reducing BERT Pre-Training Time from 3 Days to 76 Minutes
Figure 2 for Reducing BERT Pre-Training Time from 3 Days to 76 Minutes
Figure 3 for Reducing BERT Pre-Training Time from 3 Days to 76 Minutes
Figure 4 for Reducing BERT Pre-Training Time from 3 Days to 76 Minutes
Viaarxiv icon

Large-Batch Training for LSTM and Beyond

Add code
Bookmark button
Alert button
Jan 24, 2019
Yang You, Jonathan Hseu, Chris Ying, James Demmel, Kurt Keutzer, Cho-Jui Hsieh

Figure 1 for Large-Batch Training for LSTM and Beyond
Figure 2 for Large-Batch Training for LSTM and Beyond
Figure 3 for Large-Batch Training for LSTM and Beyond
Figure 4 for Large-Batch Training for LSTM and Beyond
Viaarxiv icon