The Principles of Deep Learning Theory

Roberts, Daniel A.
Yaida, Sho
Hanin, Boris

Open PDF

Open link

Publication date

August 2021

DOI

10.1017/9781009023405

Publisher

Cambridge University Press (CUP)

Language

English

Abstract

This book develops an effective theory approach to understanding deep neural networks of practical relevance. Beginning from a first-principles component-level picture of networks, we explain how to determine an accurate description of the output of trained networks by solving layer-to-layer iteration equations and nonlinear learning dynamics. A main result is that the predictions of networks are described by nearly-Gaussian distributions, with the depth-to-width aspect ratio of the network controlling the deviations from the infinite-width Gaussian description. We explain how these effectively-deep networks learn nontrivial representations from training and more broadly analyze the mechanism of representation learning for nonlinear models....

Extracted data

We use cookies to provide a better user experience.

Data Protection

The Principles of Deep Learning Theory

Abstract

Extracted data

The Principles of Deep Learning Theory

Abstract

Extracted data

Related items

Related items