Differential Equation Scaling Limits of Shaped and Unshaped Neural Networks

Li, Mufan Bill
Nica, Mihai

Publication date

October 2023

Language

English

Abstract

Recent analyses of neural networks with shaped activations (i.e. the activation function is scaled as the network size grows) have led to scaling limits described by differential equations. However, these results do not a priori tell us anything about "ordinary" unshaped networks, where the activation is unchanged as the network size grows. In this article, we find similar differential equation based asymptotic characterization for two types of unshaped networks. Firstly, we show that the following two architectures converge to the same infinite-depth-and-width limit at initialization: (i) a fully connected ResNet with a $d^{-1/2}$ factor on the residual branch, where $d$ is the network depth. (ii) a multilayer perceptron (MLP) with depth...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Differential Equation Scaling Limits of Shaped and Unshaped Neural Networks

Abstract

Extracted data

Differential Equation Scaling Limits of Shaped and Unshaped Neural Networks

Abstract

Extracted data

Related items

Related items