We investigate the structural properties of certain sequential decision-making problems with limited feedback (bandits) in order to bring the known algorithmic solutions closer to a practical use. In the first part, we put a special emphasis on structures that can be represented as graphs on actions, in the second part we study the large action spaces that can be of exponential size in the number of base actions or even infinite. We show how to take advantage of structures over the actions and (provably) learn faster
Dans cette thèse nous étudions différentes généralisations du problème dit « du bandit manchot ». Le...
International audienceThis paper investigates stochastic and adversarial combinatorial multi-armed b...
We study the problem of decision-making under uncertainty in the bandit setting. This thesis goes be...
We investigate the structural properties of certain sequential decision-making problems with limited...
This thesis studies several extensions of multi-armed bandit problem, where a learner sequentially s...
Sequential decision making is a core component of many real-world applications, from computer-networ...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
We introduce a rich class of graphical models for multi-armed bandit problems that permit both the s...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
Understanding the dynamics of complex systems, and how to optimally act in them impacts all aspects ...
This thesis considers the multi-armed bandit (MAB) problem, both the traditional bandit feedback and...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
We consider stochastic sequential learning problems where the learner can observe the \textit{averag...
Multi-armed bandit problems are receiving a great deal of attention because they adequately formaliz...
International audienceWe consider stochastic sequential learning problems where the learner can obse...
Dans cette thèse nous étudions différentes généralisations du problème dit « du bandit manchot ». Le...
International audienceThis paper investigates stochastic and adversarial combinatorial multi-armed b...
We study the problem of decision-making under uncertainty in the bandit setting. This thesis goes be...
We investigate the structural properties of certain sequential decision-making problems with limited...
This thesis studies several extensions of multi-armed bandit problem, where a learner sequentially s...
Sequential decision making is a core component of many real-world applications, from computer-networ...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
We introduce a rich class of graphical models for multi-armed bandit problems that permit both the s...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
Understanding the dynamics of complex systems, and how to optimally act in them impacts all aspects ...
This thesis considers the multi-armed bandit (MAB) problem, both the traditional bandit feedback and...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
We consider stochastic sequential learning problems where the learner can observe the \textit{averag...
Multi-armed bandit problems are receiving a great deal of attention because they adequately formaliz...
International audienceWe consider stochastic sequential learning problems where the learner can obse...
Dans cette thèse nous étudions différentes généralisations du problème dit « du bandit manchot ». Le...
International audienceThis paper investigates stochastic and adversarial combinatorial multi-armed b...
We study the problem of decision-making under uncertainty in the bandit setting. This thesis goes be...