International audienceWe consider the problem of online optimization, where a learner chooses a decision from a given decision set and suffers some loss associated with the decision and the state of the environment. The learner's objective is to minimize its cumulative regret against the best fixed decision in hindsight. Over the past few decades numerous variants have been considered, with many algorithms designed to achieve sub-linear regret in the worst case. However, this level of robustness comes at a cost. Proposed algorithms are often over-conservative, failing to adapt to the actual complexity of the loss sequence which is often far from the worst case. In this paper we introduce a general algorithm that, provided with a "safe" lear...
The regret bound of dynamic online learning algorithms is often expressed in terms of the variation ...
We address online linear optimization problems when the possible actions of the decision maker are r...
We provide a new online learning algorithm that for the first time combines several disparate notio...
International audienceWe consider the problem of online optimization, where a learner chooses a deci...
Abstract We consider the problem of online optimization, where a learner chooses a decision from a g...
We present methods for online linear optimization that take advantage of benign (as opposed to worst...
The framework of online learning with memory naturally captures learning problems with temporal effe...
Performance guarantees for online learning algorithms typically take the form of regret bounds, whic...
A number of online algorithms have been developed that have small additional loss (regret) compared ...
A number of online algorithms have been developed that have small additional loss (regret) compared ...
A number of online algorithms have been developed that have small additional loss (regret) compared ...
International audienceWe study online combinatorial optimization problems where a learner is interes...
We present methods for online linear optimization that take advantage of benign (as opposed to worst...
International audienceWe consider a variation on the problem of prediction with expert advice, where...
Much of the work in online learning focuses on the study of sublinear upper bounds on the regret. In...
The regret bound of dynamic online learning algorithms is often expressed in terms of the variation ...
We address online linear optimization problems when the possible actions of the decision maker are r...
We provide a new online learning algorithm that for the first time combines several disparate notio...
International audienceWe consider the problem of online optimization, where a learner chooses a deci...
Abstract We consider the problem of online optimization, where a learner chooses a decision from a g...
We present methods for online linear optimization that take advantage of benign (as opposed to worst...
The framework of online learning with memory naturally captures learning problems with temporal effe...
Performance guarantees for online learning algorithms typically take the form of regret bounds, whic...
A number of online algorithms have been developed that have small additional loss (regret) compared ...
A number of online algorithms have been developed that have small additional loss (regret) compared ...
A number of online algorithms have been developed that have small additional loss (regret) compared ...
International audienceWe study online combinatorial optimization problems where a learner is interes...
We present methods for online linear optimization that take advantage of benign (as opposed to worst...
International audienceWe consider a variation on the problem of prediction with expert advice, where...
Much of the work in online learning focuses on the study of sublinear upper bounds on the regret. In...
The regret bound of dynamic online learning algorithms is often expressed in terms of the variation ...
We address online linear optimization problems when the possible actions of the decision maker are r...
We provide a new online learning algorithm that for the first time combines several disparate notio...