Alert button

Critical Data Size of Language Models from a Grokking Perspective

Feb 06, 2024
Xuekai Zhu, Yao Fu, Bowen Zhou, Zhouhan Lin

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: