Vanishing Gradient Problem in Training Neural Networks

Chen, Muye

Open PDF

Open link

Publication date

January 2022

DOI

10.25911/BMDG-3N85

Abstract

The Vanishing Gradient Problem (VGP) is a frequently encountered numerical problem in training Feedforward Neural Networks (FNN) and Recurrent Neural Networks (RNN). The gradient involved in neural network optimisation can vanish and become zero in a number of ways. In this thesis we focus on the following definition of the VGP: the tendency for network loss gradients, calculated with respect to the model weight parameters, to vanish numerically in the back propagation step of network training. Due to the differences in data types on which the two types of networks are trained, the model architectures are different. Consequently the methods to alleviate the problem take different forms and focus on different model components. This thesis at...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Vanishing Gradient Problem in Training Neural Networks

Abstract

Extracted data

Vanishing Gradient Problem in Training Neural Networks

Abstract

Extracted data

Related items

Related items