The main topics adressed in this thesis lie in the general domain of sequential learning, and in particular stochastic multi-armed bandits.The thesis is divided into four chapters and an introduction. In the first part of the main body of the thesis, we design a new algorithm achieving, simultaneously, distribution-dependent and distribution-free optimal guarantees. The next two chapters are devoted to adaptivity questions. First, in the context of continuum-armed bandits, we present a new algorithm which, for the first time, does not require the knowledge of the regularity of the bandit problem it is facing. Then, we study the issue of adapting to the unknown support of the payoffs in bounded $K$-armed bandits. We provide a procedure that ...
We consider stochastic bandit problems with $K$ arms, each associated with a bounded distribution su...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...
The main topics adressed in this thesis lie in the general domain of sequential learning, and in par...
The main topics adressed in this thesis lie in the general domain of sequential learning, and in par...
The main topics adressed in this thesis lie in the general domain of sequential learning, and in par...
Cette thèse s'inscrit dans les domaines de l'apprentissage statistique et de la statistique séquenti...
This document presents in a unified way different results about the optimal solution of several mult...
This document presents in a unified way different results about the optimal solution of several mult...
A Multi-Armed Bandits (MAB) is a learning problem where an agent sequentially chooses an action amon...
This thesis studies the following topics in Machine Learning: Bandit theory, Statistical learning an...
This thesis studies the following topics in Machine Learning: Bandit theory, Statistical learning an...
Cette thèse traite des domaines suivant en Apprentissage Automatique: la théorie des Bandits, l'Appr...
Cette thèse traite des domaines suivant en Apprentissage Automatique: la théorie des Bandits, l'Appr...
International audienceIn the context of stochastic continuum-armed bandits, we present an algorithm ...
We consider stochastic bandit problems with $K$ arms, each associated with a bounded distribution su...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...
The main topics adressed in this thesis lie in the general domain of sequential learning, and in par...
The main topics adressed in this thesis lie in the general domain of sequential learning, and in par...
The main topics adressed in this thesis lie in the general domain of sequential learning, and in par...
Cette thèse s'inscrit dans les domaines de l'apprentissage statistique et de la statistique séquenti...
This document presents in a unified way different results about the optimal solution of several mult...
This document presents in a unified way different results about the optimal solution of several mult...
A Multi-Armed Bandits (MAB) is a learning problem where an agent sequentially chooses an action amon...
This thesis studies the following topics in Machine Learning: Bandit theory, Statistical learning an...
This thesis studies the following topics in Machine Learning: Bandit theory, Statistical learning an...
Cette thèse traite des domaines suivant en Apprentissage Automatique: la théorie des Bandits, l'Appr...
Cette thèse traite des domaines suivant en Apprentissage Automatique: la théorie des Bandits, l'Appr...
International audienceIn the context of stochastic continuum-armed bandits, we present an algorithm ...
We consider stochastic bandit problems with $K$ arms, each associated with a bounded distribution su...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...