Alert button

Polylogarithmic width suffices for gradient descent to achieve arbitrarily small test error with shallow ReLU networks

Sep 26, 2019
Ziwei Ji, Matus Telgarsky

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: