Adaptive Dropout for Training Deep Neural Networks

Ba, Lei

Publication date

January 2014

Abstract

Recently, it was shown that deep neural networks perform very well if the activities of hidden units are regularized during learning, e.g, by randomly dropping out 50% of their activities. We describe a method called "standout" in which a binary belief network is overlaid on a neural network and is used to regularize of its hidden units by selectively setting activities to zero. This "adaptive dropout network" can be trained jointly with the neural network by approximately computing local expectations of binary dropout variables and computing derivatives using back-propagation. Interestingly, experiments suggest that a good dropout network regularizes activities according to magnitude. When evaluated on the MNIST and NORB datasets, we found...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Adaptive Dropout for Training Deep Neural Networks

Abstract

Extracted data

Adaptive Dropout for Training Deep Neural Networks

Abstract

Extracted data

Related items

Related items