We provide several applications of Optimistic Mirror Descent, an online learning algorithm based on the idea of predictable sequences. First, we recover the Mir-ror Prox algorithm for offline optimization, prove an extension to Hölder-smooth functions, and apply the results to saddle-point type problems. Next, we prove that a version of Optimistic Mirror Descent (which has a close relation to the Ex-ponential Weights algorithm) can be used by two strongly-uncoupled players in a finite zero-sum matrix game to converge to the minimax equilibrium at the rate of O((logT)T). This addresses a question of Daskalakis et al [6]. Further, we consider a partial information version of the problem. We then apply the results to convex programming and ex...
Lagrangian relaxation and approximate optimization algorithms have received much attention in the la...
International audienceIn this paper, we examine the convergence rate of a wide range of regularized ...
We study the close connections between game theory, on-line prediction and boosting. After a brief r...
We provide several applications of Optimistic Mirror Descent, an online learning algorithm based on ...
We provide several applications of Optimistic Mirror Descent, an online learning algorithm based on ...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
We provide a general mechanism to design online learning algorithms based on a minimax analysis with...
We provide a general mechanism to design online learning algorithms based on a minimax analysis with...
We show a principled way of deriving online learning algorithms from a minimax analysis. Various upp...
We present methods for online linear optimization that take advantage of benign (as opposed to worst...
We present a simple unified analysis of adaptive Mirror Descent (MD) and Follow- the-Regularized-Lea...
A natural algorithmic scheme in online game playing is called ‘follow-the-leader’, first proposed by...
Online learning and convex optimization algorithms have become essential tools for solving problems ...
Lagrangian relaxation and approximate optimization algorithms have received much attention in the la...
International audienceIn this paper, we examine the convergence rate of a wide range of regularized ...
We study the close connections between game theory, on-line prediction and boosting. After a brief r...
We provide several applications of Optimistic Mirror Descent, an online learning algorithm based on ...
We provide several applications of Optimistic Mirror Descent, an online learning algorithm based on ...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
We provide a general mechanism to design online learning algorithms based on a minimax analysis with...
We provide a general mechanism to design online learning algorithms based on a minimax analysis with...
We show a principled way of deriving online learning algorithms from a minimax analysis. Various upp...
We present methods for online linear optimization that take advantage of benign (as opposed to worst...
We present a simple unified analysis of adaptive Mirror Descent (MD) and Follow- the-Regularized-Lea...
A natural algorithmic scheme in online game playing is called ‘follow-the-leader’, first proposed by...
Online learning and convex optimization algorithms have become essential tools for solving problems ...
Lagrangian relaxation and approximate optimization algorithms have received much attention in the la...
International audienceIn this paper, we examine the convergence rate of a wide range of regularized ...
We study the close connections between game theory, on-line prediction and boosting. After a brief r...