Alert button
Picture for Joanna Yoo

Joanna Yoo

Alert button

Scalable Training of Language Models using JAX pjit and TPUv4

Add code
Bookmark button
Alert button
Apr 13, 2022
Joanna Yoo, Kuba Perlin, Siddhartha Rao Kamalakara, João G. M. Araújo

Figure 1 for Scalable Training of Language Models using JAX pjit and TPUv4
Figure 2 for Scalable Training of Language Models using JAX pjit and TPUv4
Figure 3 for Scalable Training of Language Models using JAX pjit and TPUv4
Figure 4 for Scalable Training of Language Models using JAX pjit and TPUv4
Viaarxiv icon

SliceOut: Training Transformers and CNNs faster while using less memory

Add code
Bookmark button
Alert button
Jul 21, 2020
Pascal Notin, Aidan N. Gomez, Joanna Yoo, Yarin Gal

Figure 1 for SliceOut: Training Transformers and CNNs faster while using less memory
Figure 2 for SliceOut: Training Transformers and CNNs faster while using less memory
Figure 3 for SliceOut: Training Transformers and CNNs faster while using less memory
Figure 4 for SliceOut: Training Transformers and CNNs faster while using less memory
Viaarxiv icon