Training neural networks using Metropolis Monte Carlo and an adaptive variant

Whitelam, Stephen
Selin, Viktor
Benlolo, Ian
Casert, CorneelWE05000120052452802002708255B90ED270-F0EE-11E1-A197-91C894A0A6B4
Tamblyn, Isaac

Publication date

January 2022

Abstract

We examine the zero-temperature Metropolis Monte Carlo (MC) algorithm as a tool for training a neural network by minimizing a loss function. We find that, as expected on theoretical grounds and shown empirically by other authors, Metropolis MC can train a neural net with an accuracy comparable to that of gradient descent (GD), if not necessarily as quickly. The Metropolis algorithm does not fail automatically when the number of parameters of a neural network is large. It can fail when a neural network's structure or neuron activations are strongly heterogenous, and we introduce an adaptive Monte Carlo algorithm (aMC) to overcome these limitations. The intrinsic stochasticity and numerical stability of the MC method allow aMC to train deep n...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Training neural networks using Metropolis Monte Carlo and an adaptive variant

Abstract

Extracted data

Training neural networks using Metropolis Monte Carlo and an adaptive variant

Abstract

Extracted data

Related items

Related items