Alert button
Picture for Arvind Mahankali

Arvind Mahankali

Alert button

One Step of Gradient Descent is Provably the Optimal In-Context Learner with One Layer of Linear Self-Attention

Add code
Bookmark button
Alert button
Jul 07, 2023
Arvind Mahankali, Tatsunori B. Hashimoto, Tengyu Ma

Viaarxiv icon

Beyond NTK with Vanilla Gradient Descent: A Mean-Field Analysis of Neural Networks with Polynomial Width, Samples, and Time

Add code
Bookmark button
Alert button
Jun 28, 2023
Arvind Mahankali, Jeff Z. Haochen, Kefan Dong, Margalit Glasgow, Tengyu Ma

Viaarxiv icon