Overparameterized ReLU Neural Networks Learn the Simplest Models: Neural Isometry and Exact Recovery

Wang, Yifei
Hua, Yixuan
Candés, Emmanuel
Pilanci, Mert

Publication date

October 2022

Language

English

Abstract

The practice of deep learning has shown that neural networks generalize remarkably well even with an extreme number of learned parameters. This appears to contradict traditional statistical wisdom, in which a trade-off between model complexity and fit to the data is essential. We set out to resolve this discrepancy from a convex optimization and sparse recovery perspective. We consider the training and generalization properties of two-layer ReLU networks with standard weight decay regularization. Under certain regularity assumptions on the data, we show that ReLU networks with an arbitrary number of parameters learn only simple models that explain the data. This is analogous to the recovery of the sparsest linear model in compressed sensing...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Overparameterized ReLU Neural Networks Learn the Simplest Models: Neural Isometry and Exact Recovery

Abstract

Extracted data

Overparameterized ReLU Neural Networks Learn the Simplest Models: Neural Isometry and Exact Recovery

Abstract

Extracted data

Related items

Related items