Alert button
Picture for Fanzhuang Meng

Fanzhuang Meng

Alert button

AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster

Add code
Bookmark button
Alert button
Apr 15, 2024
Siyuan Li, Youshao Xiao, Fanzhuang Meng, Lin Ju, Lei Liang, Lin Wang, Jun Zhou

Viaarxiv icon

Rethinking Memory and Communication Cost for Efficient Large Language Model Training

Add code
Bookmark button
Alert button
Oct 09, 2023
Chan Wu, Hanxiao Zhang, Lin Ju, Jinjing Huang, Youshao Xiao, Zhaoxin Huan, Siyuan Li, Fanzhuang Meng, Lei Liang, Xiaolu Zhang, Jun Zhou

Figure 1 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 2 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 3 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 4 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Viaarxiv icon