AbstractIn an online decision problem, one makes a sequence of decisions without knowledge of the future. Each period, one pays a cost based on the decision and observed state. We give a simple approach for doing nearly as well as the best single decision, where the best is chosen with the benefit of hindsight. A natural idea is to follow the leader, i.e. each period choose the decision which has done best so far. We show that by slightly perturbing the totals and then choosing the best decision, the expected performance is nearly as good as the best decision in hindsight. Our approach, which is very much like Hannan's original game-theoretic approach from the 1950s, yields guarantees competitive with the more modern exponential weighting a...
International audienceMany decision problems have two levels: one for strategic decisions, and anoth...
In online problems, the input forms a finite sequence of requests. Each request must be processed, i...
This paper proposes a novel algorithm for solving discrete online learning prob-lems under stochasti...
A natural algorithmic scheme in online game playing is called ‘follow-the-leader’, first proposed by...
In this research we study some online learning algorithms in the online convex optimization framewor...
Abstract We consider the problem of online optimization, where a learner chooses a decision from a g...
In this paper the sequential prediction problem with expert ad-vice is considered for the case where...
In this paper, we study a sequential decision making problem. The objective is to maximize the avera...
Aiding to make decisions as early as possible by learning from past experiences is becoming increasi...
We consider the fundamental problem of prediction with expert advice where the experts are "optimiza...
Abstract. An algorithm is presented for online prediction that allows to track the best expert effic...
We consider a classic social choice problem in an online setting. In each round, a decision maker o...
. We consider the problem of dynamically apportioning resources among a set of options in a worst-ca...
We address online linear optimization problems when the possible actions of the decision maker are r...
We investigate algorithms for different steps in the decision making process, focusing on systems wh...
International audienceMany decision problems have two levels: one for strategic decisions, and anoth...
In online problems, the input forms a finite sequence of requests. Each request must be processed, i...
This paper proposes a novel algorithm for solving discrete online learning prob-lems under stochasti...
A natural algorithmic scheme in online game playing is called ‘follow-the-leader’, first proposed by...
In this research we study some online learning algorithms in the online convex optimization framewor...
Abstract We consider the problem of online optimization, where a learner chooses a decision from a g...
In this paper the sequential prediction problem with expert ad-vice is considered for the case where...
In this paper, we study a sequential decision making problem. The objective is to maximize the avera...
Aiding to make decisions as early as possible by learning from past experiences is becoming increasi...
We consider the fundamental problem of prediction with expert advice where the experts are "optimiza...
Abstract. An algorithm is presented for online prediction that allows to track the best expert effic...
We consider a classic social choice problem in an online setting. In each round, a decision maker o...
. We consider the problem of dynamically apportioning resources among a set of options in a worst-ca...
We address online linear optimization problems when the possible actions of the decision maker are r...
We investigate algorithms for different steps in the decision making process, focusing on systems wh...
International audienceMany decision problems have two levels: one for strategic decisions, and anoth...
In online problems, the input forms a finite sequence of requests. Each request must be processed, i...
This paper proposes a novel algorithm for solving discrete online learning prob-lems under stochasti...