Abstract We study online aggregation of the predictions of experts, and first show new second-order regret bounds in the standard setting, which are obtained via a version of the Prod algorithm (and also a version of the polynomially weighted average algorithm) with multiple learning rates. These bounds are in terms of excess losses, the differences between the instantaneous losses suffered by the algorithm and the ones of a given expert. We then demonstrate the interest of these bounds in the context of experts that report their confidences as a number in the interval [0, 1] using a generic reduction to the standard setting. We conclude by two other applications in the standard setting, which improve the known bounds in case of small exces...
When applying aggregating strategies to Prediction with Expert Advice, the learning rate must be ada...
A number of online algorithms have been developed that have small additional loss (regret) compared ...
Consider the classical problem of predicting the next bit in a sequence of bits. A standard performa...
We study online aggregation of the predictions of experts, and first show new second-order regret bo...
We consider prediction with expert advice when the loss vectors are assumed to lie in a set describ...
This work studies external regret in sequential prediction games with both positive and negative pay...
We investigate on-line prediction of individual sequences. Given a class of predictors, the goal is...
We investigate on-line prediction of individual sequences. Given a class of predictors, the goal is ...
Performance guarantees for online learning algorithms typically take the form of regret bounds, whic...
Consider the following game: There is a fixed set V of n items. At each step an adversary chooses a ...
We study a general class of learning algorithms, which we call regret-matching algorithms, along wit...
Abstract. We study online learning algorithms that predict by com-bining the predictions of several ...
We take advantage of the correspondence between online learning algorithms design for negative regre...
In the framework of prediction with expert advice, we consider a recently introduced kind of regret ...
Abstract. For the prediction with expert advice setting, we consider methods to construct algorithms...
When applying aggregating strategies to Prediction with Expert Advice, the learning rate must be ada...
A number of online algorithms have been developed that have small additional loss (regret) compared ...
Consider the classical problem of predicting the next bit in a sequence of bits. A standard performa...
We study online aggregation of the predictions of experts, and first show new second-order regret bo...
We consider prediction with expert advice when the loss vectors are assumed to lie in a set describ...
This work studies external regret in sequential prediction games with both positive and negative pay...
We investigate on-line prediction of individual sequences. Given a class of predictors, the goal is...
We investigate on-line prediction of individual sequences. Given a class of predictors, the goal is ...
Performance guarantees for online learning algorithms typically take the form of regret bounds, whic...
Consider the following game: There is a fixed set V of n items. At each step an adversary chooses a ...
We study a general class of learning algorithms, which we call regret-matching algorithms, along wit...
Abstract. We study online learning algorithms that predict by com-bining the predictions of several ...
We take advantage of the correspondence between online learning algorithms design for negative regre...
In the framework of prediction with expert advice, we consider a recently introduced kind of regret ...
Abstract. For the prediction with expert advice setting, we consider methods to construct algorithms...
When applying aggregating strategies to Prediction with Expert Advice, the learning rate must be ada...
A number of online algorithms have been developed that have small additional loss (regret) compared ...
Consider the classical problem of predicting the next bit in a sequence of bits. A standard performa...