We introduce a novel training principle for probabilistic models that is an alternative to maximum likelihood. The proposed Generative Stochastic Networks (GSN) framework is based on learning the transition operator of a Markov chain whose stationary distribution estimates the data distribution. The transition distribution of the Markov chain is conditional on the previous state, generally involving a small move, so this conditional distribution has fewer dominant modes, being unimodal in the limit of small moves. Thus, it is easier to learn because it is easier to approximate its partition function, more like learning to perform supervised func-tion approximation, with gradients that can be obtained by backprop. We provide theorems that ge...
Despite of remarkable progress on deep learning, its hardware implementation beyond deep learning ac...
Multilayer perceptrons (MLPs) or artificial neural nets are popular models used for non-linear regre...
Understanding the impact of data structure on the computational tractability of learning is a key ch...
We introduce a novel training principle for probabilistic models that is an al-ternative to maximum ...
We introduce a novel training principle for prob-abilistic models that is an alternative to max-imum...
We introduce a novel training principle for generative probabilistic models that is an al-ternative ...
Deep Gaussian processes (DGPs) are multi-layer hierarchical generalisations of Gaussian processes (G...
Deep Gaussian processes (DGPs) are multi-layer hierarchical generalisations of Gaussian processes (G...
We marry ideas from deep neural networks and approximate Bayesian inference to derive a generalised ...
We marry ideas from deep neural networks and approximate Bayesian inference to derive a gen-eralised...
Generative Stochastic Networks (GSNs) have been recently introduced as an al-ternative to traditiona...
This is the final version of the article. It first appeared from International Conference on Learnin...
Understanding the impact of data structure on the computational tractability of learning is a key ch...
Stochastic binary hidden units in a multi-layer perceptron (MLP) network give at least three potenti...
Here, we introduce a new family of probabilistic models called Rectified Gaussian Nets, or RGNs. RGN...
Despite of remarkable progress on deep learning, its hardware implementation beyond deep learning ac...
Multilayer perceptrons (MLPs) or artificial neural nets are popular models used for non-linear regre...
Understanding the impact of data structure on the computational tractability of learning is a key ch...
We introduce a novel training principle for probabilistic models that is an al-ternative to maximum ...
We introduce a novel training principle for prob-abilistic models that is an alternative to max-imum...
We introduce a novel training principle for generative probabilistic models that is an al-ternative ...
Deep Gaussian processes (DGPs) are multi-layer hierarchical generalisations of Gaussian processes (G...
Deep Gaussian processes (DGPs) are multi-layer hierarchical generalisations of Gaussian processes (G...
We marry ideas from deep neural networks and approximate Bayesian inference to derive a generalised ...
We marry ideas from deep neural networks and approximate Bayesian inference to derive a gen-eralised...
Generative Stochastic Networks (GSNs) have been recently introduced as an al-ternative to traditiona...
This is the final version of the article. It first appeared from International Conference on Learnin...
Understanding the impact of data structure on the computational tractability of learning is a key ch...
Stochastic binary hidden units in a multi-layer perceptron (MLP) network give at least three potenti...
Here, we introduce a new family of probabilistic models called Rectified Gaussian Nets, or RGNs. RGN...
Despite of remarkable progress on deep learning, its hardware implementation beyond deep learning ac...
Multilayer perceptrons (MLPs) or artificial neural nets are popular models used for non-linear regre...
Understanding the impact of data structure on the computational tractability of learning is a key ch...