Relax and Randomize : From Value to Algorithms

Rakhlin, Alexander
Shamir, Ohad
Sridharan, Karthik

Publication date

January 2012

Publisher

ScholarlyCommons

Abstract

We show a principled way of deriving online learning algorithms from a minimax analysis. Various upper bounds on the minimax value, previously thought to be non-constructive, are shown to yield algorithms. This allows us to seamlessly recover known methods and to derive new ones, also capturing such “unorthodox” methods as Follow the Perturbed Leader and the R2 forecaster. Understanding the inherent complexity of the learning problem thus leads to the development of algorithms. To illustrate our approach, we present several new algorithms, including a family of randomized methods that use the idea of a “random playout”. New versions of the Follow-the-Perturbed-Leader algorithms are presented, as well as methods based on the Littlestone’s di...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Relax and Randomize : From Value to Algorithms

Abstract

Extracted data

Relax and Randomize : From Value to Algorithms

Abstract

Extracted data

Related items

Related items