Averaging scheme has attracted extensive attention in deep learning as well as traditional machine learning. It achieves theoretically optimal convergence and also improves the empirical model performance. However, there is still a lack of sufficient convergence analysis for strongly convex optimization. Typically, the convergence about the last iterate of gradient descent methods, which is referred to as individual convergence, fails to attain its optimality due to the existence of logarithmic factor. In order to remove this factor, we first develop gradient descent averaging (GDA), which is a general projection-based dual averaging algorithm in the strongly convex setting. We further present primal-dual averaging for strongly convex cases...
Free to read at publisher website We study accelerated descent dynamics for constrained convex optim...
With a weighting scheme proportional to t, a traditional stochastic gradient descent (SGD) algorithm...
Thesis: Ph. D. in Mathematics and Operations Research, Massachusetts Institute of Technology, Depart...
Stochastic gradient descent (SGD) is a simple and very popular iterative method to solve stochastic ...
Stochastic gradient descent (SGD) is a simple and popular method to solve stochastic optimization pr...
An usual problem in statistics consists in estimating the minimizer of a convex function. When we ha...
An usual problem in statistics consists in estimating the minimizer of a convex function. When we ha...
We describe a primal-dual framework for the design and analysis of online strongly convex optimizati...
International audienceWe introduce and analyze a new family of first-order optimization algorithms w...
We introduce and analyse a new family of algorithms which generalizes and unifies both the mirror de...
Stochastic Gradient Descent (SGD) is one of the simplest and most popular stochastic optimization me...
We introduce a new algorithm, extended regularized dual averaging (XRDA), for solving regularized st...
Consider the problem of minimizing functions that are Lipschitz and strongly convex, but not necessa...
The dissertation addresses the research topics of machine learning outlined below. We developed the ...
Abstract We propose a simple variant of the generalized Frank–Wolfe method for solving ...
Free to read at publisher website We study accelerated descent dynamics for constrained convex optim...
With a weighting scheme proportional to t, a traditional stochastic gradient descent (SGD) algorithm...
Thesis: Ph. D. in Mathematics and Operations Research, Massachusetts Institute of Technology, Depart...
Stochastic gradient descent (SGD) is a simple and very popular iterative method to solve stochastic ...
Stochastic gradient descent (SGD) is a simple and popular method to solve stochastic optimization pr...
An usual problem in statistics consists in estimating the minimizer of a convex function. When we ha...
An usual problem in statistics consists in estimating the minimizer of a convex function. When we ha...
We describe a primal-dual framework for the design and analysis of online strongly convex optimizati...
International audienceWe introduce and analyze a new family of first-order optimization algorithms w...
We introduce and analyse a new family of algorithms which generalizes and unifies both the mirror de...
Stochastic Gradient Descent (SGD) is one of the simplest and most popular stochastic optimization me...
We introduce a new algorithm, extended regularized dual averaging (XRDA), for solving regularized st...
Consider the problem of minimizing functions that are Lipschitz and strongly convex, but not necessa...
The dissertation addresses the research topics of machine learning outlined below. We developed the ...
Abstract We propose a simple variant of the generalized Frank–Wolfe method for solving ...
Free to read at publisher website We study accelerated descent dynamics for constrained convex optim...
With a weighting scheme proportional to t, a traditional stochastic gradient descent (SGD) algorithm...
Thesis: Ph. D. in Mathematics and Operations Research, Massachusetts Institute of Technology, Depart...