AutoInit: Analytic Signal-Preserving Weight Initialization for Neural Networks

Bingham, Garrett
Miikkulainen, Risto

Open link

Publication date

June 2023

DOI

10.1609/aaai.v37i6.25836

Publisher

Association for the Advancement of Artificial Intelligence

Abstract

Neural networks require careful weight initialization to prevent signals from exploding or vanishing. Existing initialization schemes solve this problem in specific cases by assuming that the network has a certain activation function or topology. It is difficult to derive such weight initialization strategies, and modern architectures therefore often use these same initialization schemes even though their assumptions do not hold. This paper introduces AutoInit, a weight initialization algorithm that automatically adapts to different neural network architectures. By analytically tracking the mean and variance of signals as they propagate through the network, AutoInit appropriately scales the weights at each layer to avoid exploding or van...

Extracted data

We use cookies to provide a better user experience.

Data Protection

AutoInit: Analytic Signal-Preserving Weight Initialization for Neural Networks

Abstract

Extracted data

AutoInit: Analytic Signal-Preserving Weight Initialization for Neural Networks

Abstract

Extracted data

Related items

Related items