We show a principled way of deriving online learning algorithms from a minimax analysis. Various upper bounds on the minimax value, previously thought to be non-constructive, are shown to yield algorithms. This allows us to seamlessly recover known methods and to derive new ones, also capturing such “unorthodox” methods as Follow the Perturbed Leader and the R2 forecaster. Understanding the inherent complexity of the learning problem thus leads to the development of algorithms. To illustrate our approach, we present several new algorithms, including a family of randomized methods that use the idea of a “random playout”. New versions of the Follow-the-Perturbed-Leader algorithms are presented, as well as methods based on the Littlestone’s di...
We consider the fundamental problem of prediction with expert advice where the experts are "optimiza...
We provide a general mechanism to design online learning algorithms based on a minimax analysis with...
We provide a general mechanism to design online learning algorithms based on a minimax analysis with...
We show a principled way of deriving online learning algorithms from a minimax analysis. Various upp...
We provide several applications of Optimistic Mirror Descent, an online learning algorithm based on ...
We consider online learning when the time hori-zon is unknown. We apply a minimax analysis, beginnin...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
We provide several applications of Optimistic Mirror Descent, an online learning algorithm based on ...
In online learning the performance of an algorithm is typically compared to the performance of a fix...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
We present methods for online linear optimization that take advantage of benign (as opposed to worst...
Most online algorithms used in machine learning today are based on vari-ants of mirror descent or fo...
Thesis (Ph.D.)--University of Washington, 2020We present several novel results on computational prob...
Most traditional online learning algorithms are based on variants of mirror descent or follow-the-le...
We consider the fundamental problem of prediction with expert advice where the experts are "optimiza...
We provide a general mechanism to design online learning algorithms based on a minimax analysis with...
We provide a general mechanism to design online learning algorithms based on a minimax analysis with...
We show a principled way of deriving online learning algorithms from a minimax analysis. Various upp...
We provide several applications of Optimistic Mirror Descent, an online learning algorithm based on ...
We consider online learning when the time hori-zon is unknown. We apply a minimax analysis, beginnin...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
We provide several applications of Optimistic Mirror Descent, an online learning algorithm based on ...
In online learning the performance of an algorithm is typically compared to the performance of a fix...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
We present methods for online linear optimization that take advantage of benign (as opposed to worst...
Most online algorithms used in machine learning today are based on vari-ants of mirror descent or fo...
Thesis (Ph.D.)--University of Washington, 2020We present several novel results on computational prob...
Most traditional online learning algorithms are based on variants of mirror descent or follow-the-le...
We consider the fundamental problem of prediction with expert advice where the experts are "optimiza...
We provide a general mechanism to design online learning algorithms based on a minimax analysis with...
We provide a general mechanism to design online learning algorithms based on a minimax analysis with...