The field of statistical machine learning has seen a rapid progress in complex hierarchical Bayesian models. In Stochastic Variational Inference (SVI), the in-ference problem is mapped to an optimization problem involving stochastic gra-dients. While this scheme was shown to scale up to massive data sets, the intrinsic noise of the stochastic gradients impedes a fast convergence. Inspired by gradient averaging methods from stochastic optimization, we propose a variance reduction scheme tailored to SVI by averaging successively over the sufficient statistics of the local variational parameters. Its simplicity comes at the cost of biased stochas-tic gradients. We show that we can eliminate large parts of the bias while obtaining the same vari...
International audienceStochastic Gradient Descent (SGD) is a workhorse in machine learning, yet its ...
Stochastic variational inference is a promising method for fitting large-scale probabilistic models ...
This work considers optimization methods for large-scale machine learning (ML). Optimization in ML ...
Stochastic gradient optimization is a class of widely used algorithms for training machine learning ...
<p>Stochastic gradient optimization is a class of widely used algorithms for training machine learni...
Stochastic variational inference (SVI) uses stochastic optimization to scale up Bayesian computation...
Stochastic gradient descent is popular for large scale optimization but has slow convergence asympto...
With the purpose of examining biased updates in variance-reduced stochastic gradient methods, we int...
Abstract Stochastic variational inference makes it possible to approximate posterior distributions i...
Stochastic variational inference finds good posterior approximations of probabilistic models with ve...
Stochastic gradient descent is popular for large scale optimization but has slow convergence asympto...
<p>Stochastic variational inference finds good posterior approximations of probabilistic models with...
Optimization with noisy gradients has become ubiquitous in statistics and machine learning. Reparame...
Variational inference approximates the posterior distribution of a probabilistic model with a parame...
Stochastic variational inference makes it possible to approximate posterior distributions induced by...
International audienceStochastic Gradient Descent (SGD) is a workhorse in machine learning, yet its ...
Stochastic variational inference is a promising method for fitting large-scale probabilistic models ...
This work considers optimization methods for large-scale machine learning (ML). Optimization in ML ...
Stochastic gradient optimization is a class of widely used algorithms for training machine learning ...
<p>Stochastic gradient optimization is a class of widely used algorithms for training machine learni...
Stochastic variational inference (SVI) uses stochastic optimization to scale up Bayesian computation...
Stochastic gradient descent is popular for large scale optimization but has slow convergence asympto...
With the purpose of examining biased updates in variance-reduced stochastic gradient methods, we int...
Abstract Stochastic variational inference makes it possible to approximate posterior distributions i...
Stochastic variational inference finds good posterior approximations of probabilistic models with ve...
Stochastic gradient descent is popular for large scale optimization but has slow convergence asympto...
<p>Stochastic variational inference finds good posterior approximations of probabilistic models with...
Optimization with noisy gradients has become ubiquitous in statistics and machine learning. Reparame...
Variational inference approximates the posterior distribution of a probabilistic model with a parame...
Stochastic variational inference makes it possible to approximate posterior distributions induced by...
International audienceStochastic Gradient Descent (SGD) is a workhorse in machine learning, yet its ...
Stochastic variational inference is a promising method for fitting large-scale probabilistic models ...
This work considers optimization methods for large-scale machine learning (ML). Optimization in ML ...