Alert button

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

Add code
Bookmark button
Alert button
Sep 19, 2019
Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick LeGresley, Jared Casper, Bryan Catanzaro

Figure 1 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Figure 2 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Figure 3 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Figure 4 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: