A stochastic combinatorial semi-bandit with a linear payoff is a sequential learning problem where at each step a learning agent chooses a subset of ground items subject to some combi-natorial constraints, then observes noisy weights of all chosen items, and finally receives their sum as a payoff. In this work, we close the problem of computationally and sample efficient learning in stochastic combinatorial semi-bandits. In par-ticular, we show that a relatively simple learn-ing algorithm, which is known to be computa-tionally efficient, also achieves near-optimal re-gret. We refer to this method as CombUCB1, and show that its n-step regret is O(KL(1/∆) log n) and O( KLn log n), where L is the number of ground items, K is the maximum number...
We consider the combinatorial bandits problem with semi-bandit feedback under finite sampling budget...
This thesis investigates sequential decision making tasks that fall in the framework of reinforcemen...
International audienceThis paper introduces and addresses a wide class of stochastic bandit problems...
A stochastic combinatorial semi-bandit is an on-line learning problem where at each step a learn-ing...
A stochastic combinatorial semi-bandit is an on-line learning problem where at each step a learn-ing...
A stochastic combinatorial semi-bandit is an on-line learning problem where at each step a learn-ing...
International audienceWe improve the efficiency of algorithms for stochastic combinatorial semi-band...
International audienceThis paper investigates stochastic and adversarial combinatorial multi-armed b...
This paper investigates stochastic and adversarial combinatorial multi-armed bandit problems. In the...
The contextual combinatorial semi-bandit problem with linear payoff functions is a decision-making p...
In the classical stochastic k-armed bandit problem, in each of a sequence of rounds, a decision make...
The greedy algorithm is extensively studied in the field of combinatorial optimiza-tion for decades....
In this paper, we consider efficient learning in large-scale combinatorial semi-bandits with linear ...
International audienceWe consider combinatorial semi-bandits over a set X ⊂ {0, 1} d where rewards a...
International audienceWe consider combinatorial semi-bandits over a set of arms X ⊂ {0, 1} d where r...
We consider the combinatorial bandits problem with semi-bandit feedback under finite sampling budget...
This thesis investigates sequential decision making tasks that fall in the framework of reinforcemen...
International audienceThis paper introduces and addresses a wide class of stochastic bandit problems...
A stochastic combinatorial semi-bandit is an on-line learning problem where at each step a learn-ing...
A stochastic combinatorial semi-bandit is an on-line learning problem where at each step a learn-ing...
A stochastic combinatorial semi-bandit is an on-line learning problem where at each step a learn-ing...
International audienceWe improve the efficiency of algorithms for stochastic combinatorial semi-band...
International audienceThis paper investigates stochastic and adversarial combinatorial multi-armed b...
This paper investigates stochastic and adversarial combinatorial multi-armed bandit problems. In the...
The contextual combinatorial semi-bandit problem with linear payoff functions is a decision-making p...
In the classical stochastic k-armed bandit problem, in each of a sequence of rounds, a decision make...
The greedy algorithm is extensively studied in the field of combinatorial optimiza-tion for decades....
In this paper, we consider efficient learning in large-scale combinatorial semi-bandits with linear ...
International audienceWe consider combinatorial semi-bandits over a set X ⊂ {0, 1} d where rewards a...
International audienceWe consider combinatorial semi-bandits over a set of arms X ⊂ {0, 1} d where r...
We consider the combinatorial bandits problem with semi-bandit feedback under finite sampling budget...
This thesis investigates sequential decision making tasks that fall in the framework of reinforcemen...
International audienceThis paper introduces and addresses a wide class of stochastic bandit problems...