International audienceWe propose the kl-UCB ++ algorithm for regret minimization in stochastic bandit models with exponential families of distributions. We prove that it is simultaneously asymptotically optimal (in the sense of Lai and Robbins' lower bound) and minimax optimal. This is the first algorithm proved to enjoy these two properties at the same time. This work thus merges two different lines of research with simple and clear proofs
We consider a stochastic bandit problem with in-finitely many arms. In this setting, the learner has...
International audienceIn the classical multi-armed bandit problem, d arms are available to the decis...
International audienceThis paper introduces and addresses a wide class of stochastic bandit problems...
International audienceWe propose the kl-UCB ++ algorithm for regret minimization in stochastic bandi...
We consider stochastic multi-armed bandit problems where the expected reward is a Lipschitz function...
International audienceWe consider the problem of regret minimization in non-parametric stochastic ba...
Cette thèse s'inscrit dans les domaines de l'apprentissage statistique et de la statistique séquenti...
International audienceWe consider $K$-–armed stochastic bandits and consider cumulative regret bound...
International audienceWe consider stochastic multi-armed bandit problems where the expected reward i...
We consider stochastic multi-armed bandit problems where the expected reward is a Lipschitzfunction ...
The topics addressed in this thesis lie in statistical machine learning and sequential statistic. Ou...
We improve the theoretical analysis and empirical performance of algorithms for the stochastic multi...
We improve the theoretical analysis and empirical performance of algorithms for the stochastic multi...
We consider a stochastic bandit problem with in-finitely many arms. In this setting, the learner has...
International audienceIn the classical multi-armed bandit problem, d arms are available to the decis...
International audienceThis paper introduces and addresses a wide class of stochastic bandit problems...
International audienceWe propose the kl-UCB ++ algorithm for regret minimization in stochastic bandi...
We consider stochastic multi-armed bandit problems where the expected reward is a Lipschitz function...
International audienceWe consider the problem of regret minimization in non-parametric stochastic ba...
Cette thèse s'inscrit dans les domaines de l'apprentissage statistique et de la statistique séquenti...
International audienceWe consider $K$-–armed stochastic bandits and consider cumulative regret bound...
International audienceWe consider stochastic multi-armed bandit problems where the expected reward i...
We consider stochastic multi-armed bandit problems where the expected reward is a Lipschitzfunction ...
The topics addressed in this thesis lie in statistical machine learning and sequential statistic. Ou...
We improve the theoretical analysis and empirical performance of algorithms for the stochastic multi...
We improve the theoretical analysis and empirical performance of algorithms for the stochastic multi...
We consider a stochastic bandit problem with in-finitely many arms. In this setting, the learner has...
International audienceIn the classical multi-armed bandit problem, d arms are available to the decis...
International audienceThis paper introduces and addresses a wide class of stochastic bandit problems...