We investigate the structural properties of certain sequential decision-making problems with limited feedback (bandits) in order to bring the known algorithmic solutions closer to a practical use. In the first part, we put a special emphasis on structures that can be represented as graphs on actions, in the second part we study the large action spaces that can be of exponential size in the number of base actions or even infinite. We show how to take advantage of structures over the actions and (provably) learn faster
International audienceWe consider stochastic sequential learning problems where the learner can obse...
Multi-armed bandit problems are receiving a great deal of attention because they adequately formaliz...
We study bandits with graph-structured feedback, where a learner repeatedly selects an arm and then ...
We investigate the structural properties of certain sequential decision-making problems with limited...
This thesis studies several extensions of multi-armed bandit problem, where a learner sequentially s...
Sequential decision making is a core component of many real-world applications, from computer-networ...
We introduce a rich class of graphical models for multi-armed bandit problems that permit both the s...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
This thesis considers the multi-armed bandit (MAB) problem, both the traditional bandit feedback and...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
We consider stochastic sequential learning problems where the learner can observe the \textit{averag...
Understanding the dynamics of complex systems, and how to optimally act in them impacts all aspects ...
Dans cette thèse nous étudions différentes généralisations du problème dit « du bandit manchot ». Le...
We study the problem of decision-making under uncertainty in the bandit setting. This thesis goes be...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
International audienceWe consider stochastic sequential learning problems where the learner can obse...
Multi-armed bandit problems are receiving a great deal of attention because they adequately formaliz...
We study bandits with graph-structured feedback, where a learner repeatedly selects an arm and then ...
We investigate the structural properties of certain sequential decision-making problems with limited...
This thesis studies several extensions of multi-armed bandit problem, where a learner sequentially s...
Sequential decision making is a core component of many real-world applications, from computer-networ...
We introduce a rich class of graphical models for multi-armed bandit problems that permit both the s...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
This thesis considers the multi-armed bandit (MAB) problem, both the traditional bandit feedback and...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
We consider stochastic sequential learning problems where the learner can observe the \textit{averag...
Understanding the dynamics of complex systems, and how to optimally act in them impacts all aspects ...
Dans cette thèse nous étudions différentes généralisations du problème dit « du bandit manchot ». Le...
We study the problem of decision-making under uncertainty in the bandit setting. This thesis goes be...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
International audienceWe consider stochastic sequential learning problems where the learner can obse...
Multi-armed bandit problems are receiving a great deal of attention because they adequately formaliz...
We study bandits with graph-structured feedback, where a learner repeatedly selects an arm and then ...