A stochastic combinatorial semi-bandit is an on-line learning problem where at each step a learn-ing agent chooses a subset of ground items sub-ject to constraints, and then observes stochastic weights of these items and receives their sum as a payoff. In this paper, we close the problem of computationally and sample efficient learning in stochastic combinatorial semi-bandits. In partic-ular, we analyze a UCB-like algorithm for solv-ing the problem, which is known to be computa-tionally efficient; and prove O(KL(1/∆) log n) and O( KLn log n) upper bounds on its n-step regret, where L is the number of ground items, K is the maximum number of chosen items, and ∆ is the gap between the expected returns of the optimal and best suboptimal soluti...
International audienceWe consider combinatorial semi-bandits over a set of arms X ⊂ {0, 1} d where r...
We consider stochastic multi-armed bandit problems where the expected reward is a Lipschitz function...
We study the problem of combinatorial pure exploration in the stochastic multi-armed bandit problem....
A stochastic combinatorial semi-bandit is an on-line learning problem where at each step a learn-ing...
A stochastic combinatorial semi-bandit with a linear payoff is a sequential learning problem where a...
A stochastic combinatorial semi-bandit is an on-line learning problem where at each step a learn-ing...
This paper investigates stochastic and adversarial combinatorial multi-armed bandit problems. In the...
International audienceThis paper investigates stochastic and adversarial combinatorial multi-armed b...
International audienceWe improve the efficiency of algorithms for stochastic combinatorial semi-band...
The greedy algorithm is extensively studied in the field of combinatorial optimiza-tion for decades....
The contextual combinatorial semi-bandit problem with linear payoff functions is a decision-making p...
In this paper, we consider efficient learning in large-scale combinatorial semi-bandits with linear ...
In the classical stochastic k-armed bandit problem, in each of a sequence of rounds, a decision make...
International audienceWe consider combinatorial semi-bandits over a set X ⊂ {0, 1} d where rewards a...
International audienceThis paper introduces and addresses a wide class of stochastic bandit problems...
International audienceWe consider combinatorial semi-bandits over a set of arms X ⊂ {0, 1} d where r...
We consider stochastic multi-armed bandit problems where the expected reward is a Lipschitz function...
We study the problem of combinatorial pure exploration in the stochastic multi-armed bandit problem....
A stochastic combinatorial semi-bandit is an on-line learning problem where at each step a learn-ing...
A stochastic combinatorial semi-bandit with a linear payoff is a sequential learning problem where a...
A stochastic combinatorial semi-bandit is an on-line learning problem where at each step a learn-ing...
This paper investigates stochastic and adversarial combinatorial multi-armed bandit problems. In the...
International audienceThis paper investigates stochastic and adversarial combinatorial multi-armed b...
International audienceWe improve the efficiency of algorithms for stochastic combinatorial semi-band...
The greedy algorithm is extensively studied in the field of combinatorial optimiza-tion for decades....
The contextual combinatorial semi-bandit problem with linear payoff functions is a decision-making p...
In this paper, we consider efficient learning in large-scale combinatorial semi-bandits with linear ...
In the classical stochastic k-armed bandit problem, in each of a sequence of rounds, a decision make...
International audienceWe consider combinatorial semi-bandits over a set X ⊂ {0, 1} d where rewards a...
International audienceThis paper introduces and addresses a wide class of stochastic bandit problems...
International audienceWe consider combinatorial semi-bandits over a set of arms X ⊂ {0, 1} d where r...
We consider stochastic multi-armed bandit problems where the expected reward is a Lipschitz function...
We study the problem of combinatorial pure exploration in the stochastic multi-armed bandit problem....