Predicting parameters in deep learning

Shakibi, Babak

Publication date

November 2014

Publisher

University of British Columbia Press

Abstract

The recent success of large and deep neural network models has motivated the training of even larger and deeper networks with millions of parameters. Training these models usually requires parallel training methods where communicating large number of parameters becomes one of the main bottlenecks. We show that many deep learning models are over-parameterized and their learned features can be predicted given only a small fraction of their parameters. We then propose a method which exploits this fact during the training to reduce the number of parameters that need to be learned. Our method is orthogonal to the choice of network architecture and can be applied in a wide variety of neural network architectures and application areas. We evaluate...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Predicting parameters in deep learning

Abstract

Extracted data

Predicting parameters in deep learning

Abstract

Extracted data

Topics

Related items

Topics

Related items