Abstract We examine the zero-temperature Metropolis Monte Carlo (MC) algorithm as a tool for training a neural network by minimizing a loss function. We find that, as expected on theoretical grounds and shown empirically by other authors, Metropolis MC can train a neural net with an accuracy comparable to that of gradient descent (GD), if not necessarily as quickly. The Metropolis algorithm does not fail automatically when the number of parameters of a neural network is large. It can fail when a neural network’s structure or neuron activations are strongly heterogenous, and we introduce an adaptive Monte Carlo algorithm (aMC) to overcome these limitations. The intrinsic stochasticity and numerical stability of the MC method a...
In this thesis, we study the sequential Monte Carlo method for training neural networks in the conte...
We use the Monte Carlo Adaptation learning algorithm to design feed-back neural networks with discre...
We study a class of adaptive Markov Chain Monte Carlo (MCMC) processes which aim at behaving as an “...
We examine the zero-temperature Metropolis Monte Carlo (MC) algorithm as a tool for training a neura...
We show how a feed-forward neural network can be sucessfully trained by using a simulated annealing ...
Training a neural network is a difficult optimization problem because of numerous local minimums. M...
Abstract – Training a neural network is a difficult optimization problem because of numerous local m...
We propose a novel strategy for training neural networks using sequential Monte Carlo algorithms. Th...
Na przykładzie dwuwymiarowego modelu Isinga pokazujemy, że w algorytmach typu Markov Chain Monte Car...
Conventional training methods for neural networks involve starting al a random location in the solut...
Learning probability distributions on the weights of neural networks has recently proven beneficial ...
We introduce a gradient-based learning method to automatically adapt Markov chain Monte Carlo (MCMC)...
We discuss a novel strategy for training neural networks using sequential Monte Carlo algorithms and...
Random cost simulations were introduced as a method to investigate optimization prob-lems in systems...
Background: Markov chain Monte Carlo (MCMC) methods for deep learning are not commonly used because ...
In this thesis, we study the sequential Monte Carlo method for training neural networks in the conte...
We use the Monte Carlo Adaptation learning algorithm to design feed-back neural networks with discre...
We study a class of adaptive Markov Chain Monte Carlo (MCMC) processes which aim at behaving as an “...
We examine the zero-temperature Metropolis Monte Carlo (MC) algorithm as a tool for training a neura...
We show how a feed-forward neural network can be sucessfully trained by using a simulated annealing ...
Training a neural network is a difficult optimization problem because of numerous local minimums. M...
Abstract – Training a neural network is a difficult optimization problem because of numerous local m...
We propose a novel strategy for training neural networks using sequential Monte Carlo algorithms. Th...
Na przykładzie dwuwymiarowego modelu Isinga pokazujemy, że w algorytmach typu Markov Chain Monte Car...
Conventional training methods for neural networks involve starting al a random location in the solut...
Learning probability distributions on the weights of neural networks has recently proven beneficial ...
We introduce a gradient-based learning method to automatically adapt Markov chain Monte Carlo (MCMC)...
We discuss a novel strategy for training neural networks using sequential Monte Carlo algorithms and...
Random cost simulations were introduced as a method to investigate optimization prob-lems in systems...
Background: Markov chain Monte Carlo (MCMC) methods for deep learning are not commonly used because ...
In this thesis, we study the sequential Monte Carlo method for training neural networks in the conte...
We use the Monte Carlo Adaptation learning algorithm to design feed-back neural networks with discre...
We study a class of adaptive Markov Chain Monte Carlo (MCMC) processes which aim at behaving as an “...