Alert button

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Add code
Bookmark button
Alert button
Jul 19, 2023
Jiayu Ding, Shuming Ma, Li Dong, Xingxing Zhang, Shaohan Huang, Wenhui Wang, Nanning Zheng, Furu Wei

Figure 1 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 2 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 3 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 4 for LongNet: Scaling Transformers to 1,000,000,000 Tokens

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: