We consider a linear stochastic bandit problem where the dimension $K$ of the unknown parameter $\theta$ is larger than the sampling budget $n$. In such cases, it is in general impossible to derive sub-linear regret bounds since usual linear bandit algorithms have a regret in $O(K\sqrt{n})$. In this paper we assume that $\theta$ is $S-$sparse, i.e.~has at most $S-$non-zero components, and that the space of arms is the unit ball for the $||.||_2$ norm. We combine ideas from Compressed Sensing and Bandit Theory and derive algorithms with regret bounds in $O(S\sqrt{n})$
International audienceThis work addresses the problem of regret minimization in non-stochastic multi...
International audienceWe consider a generalization of stochastic bandit problems where the set of ar...
We present a modification of the algorithm of Dani et al. [8] for the online linear optimization pro...
We consider a linear stochastic bandit problem where the dimension $K$ of the unknown parameter $\th...
We consider a linear stochastic bandit prob-lem where the dimension K of the unknown parameter is l...
International audienceIn the classical multi-armed bandit problem, d arms are available to the decis...
International audienceIn the classical multi-armed bandit problem, d arms are available to the decis...
Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We fo...
In sparse linear bandits, a learning agent sequentially selects an action and receive reward feedbac...
International audienceThis paper introduces and addresses a wide class of stochastic bandit problems...
We consider the problem of online learning in misspecified linear stochastic multi-armed bandit prob...
Regret minimisation in stochastic multi-armed bandits is a well-studied problem, for which several o...
We study stochastic linear payoff bandit prob-lems and give a simple, computationally ef-ficient alg...
Stochastic high dimensional bandit problems with low dimensional structures are useful in different ...
International audienceWe consider a generalization of stochastic bandits where the set of arms, $\cX...
International audienceThis work addresses the problem of regret minimization in non-stochastic multi...
International audienceWe consider a generalization of stochastic bandit problems where the set of ar...
We present a modification of the algorithm of Dani et al. [8] for the online linear optimization pro...
We consider a linear stochastic bandit problem where the dimension $K$ of the unknown parameter $\th...
We consider a linear stochastic bandit prob-lem where the dimension K of the unknown parameter is l...
International audienceIn the classical multi-armed bandit problem, d arms are available to the decis...
International audienceIn the classical multi-armed bandit problem, d arms are available to the decis...
Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We fo...
In sparse linear bandits, a learning agent sequentially selects an action and receive reward feedbac...
International audienceThis paper introduces and addresses a wide class of stochastic bandit problems...
We consider the problem of online learning in misspecified linear stochastic multi-armed bandit prob...
Regret minimisation in stochastic multi-armed bandits is a well-studied problem, for which several o...
We study stochastic linear payoff bandit prob-lems and give a simple, computationally ef-ficient alg...
Stochastic high dimensional bandit problems with low dimensional structures are useful in different ...
International audienceWe consider a generalization of stochastic bandits where the set of arms, $\cX...
International audienceThis work addresses the problem of regret minimization in non-stochastic multi...
International audienceWe consider a generalization of stochastic bandit problems where the set of ar...
We present a modification of the algorithm of Dani et al. [8] for the online linear optimization pro...