Centered Weight Normalization in Accelerating Training of Deep Neural Networks

Huang, L
Liu, X
Liu, Y
Lang, B
Tao, D

Open PDF

Open link

Publication date

December 2017

DOI

10.1109/ICCV.2017.305

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Journal

1550-5499

Abstract

© 2017 IEEE. Training deep neural networks is difficult for the pathological curvature problem. Re-parameterization is an effective way to relieve the problem by learning the curvature approximately or constraining the solutions of weights with good properties for optimization. This paper proposes to reparameterize the input weight of each neuron in deep neural networks by normalizing it with zero-mean and unit-norm, followed by a learnable scalar parameter to adjust the norm of the weight. This technique effectively stabilizes the distribution implicitly. Besides, it improves the conditioning of the optimization problem and thus accelerates the training of deep neural networks. It can be wrapped as a linear module in practice and plugged i...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Centered Weight Normalization in Accelerating Training of Deep Neural Networks

Abstract

Extracted data

Centered Weight Normalization in Accelerating Training of Deep Neural Networks

Abstract

Extracted data

Related items

Related items