Many applications can be modeled as follows: an agent is given access to several distributions and she wishes to determine those that meet some pre-specified criteria by sampling from the distributions in a sequential experiment. For example internet companies often perform A/B/n testing, which consists of determining which of several website design options is the best (e.g., maximizes the probability of a purchase) by diverting traffic to each of the options. As another example, in crowdsourcing, it is important to identify high-quality workers from a large pool of workers, e.g., those who give the correct answer with the highest probability on a random question. It is common practice to use "gold standard" questions, i.e., questions whose...
We study the problem of selecting K arms with the highest expected rewards in a stochastic n-armed b...
There are numerous situations when a service requester wishes to expertsource a series of identical ...
We survey the literature on multi-armed bandit models and their applications in economics. The multi...
Many applications can be modeled as follows: an agent is given access to several distributions and s...
Consider a requester who wishes to crowdsource a series of identical binary labeling tasks from a po...
Very recently crowdsourcing has become the de facto platform for distributing and collecting hu-man ...
Multi-armed bandit, a popular framework for sequential decision-making problems, has recently gained...
This thesis considers the multi-armed bandit (MAB) problem, both the traditional bandit feedback and...
The multi-armed bandit (MAB) problem provides a convenient abstraction for many online decision prob...
In this work, we explore an online reinforcement learning problem called the multi-armed bandit for ...
We address the problem of online sequential decision making, i.e., balancing the trade-off between e...
The stochastic multi-armed bandit problem is an important model for studying the exploration-exploit...
Increasingly, organisations flexibly outsource work on a temporary basis to a global audience of wor...
Part 5: Machine LearningInternational audienceThe multi-armed bandit problem has been studied for de...
We address the expert crowdsourcing problem, in which an employer wishes to assign tasks to a set of...
We study the problem of selecting K arms with the highest expected rewards in a stochastic n-armed b...
There are numerous situations when a service requester wishes to expertsource a series of identical ...
We survey the literature on multi-armed bandit models and their applications in economics. The multi...
Many applications can be modeled as follows: an agent is given access to several distributions and s...
Consider a requester who wishes to crowdsource a series of identical binary labeling tasks from a po...
Very recently crowdsourcing has become the de facto platform for distributing and collecting hu-man ...
Multi-armed bandit, a popular framework for sequential decision-making problems, has recently gained...
This thesis considers the multi-armed bandit (MAB) problem, both the traditional bandit feedback and...
The multi-armed bandit (MAB) problem provides a convenient abstraction for many online decision prob...
In this work, we explore an online reinforcement learning problem called the multi-armed bandit for ...
We address the problem of online sequential decision making, i.e., balancing the trade-off between e...
The stochastic multi-armed bandit problem is an important model for studying the exploration-exploit...
Increasingly, organisations flexibly outsource work on a temporary basis to a global audience of wor...
Part 5: Machine LearningInternational audienceThe multi-armed bandit problem has been studied for de...
We address the expert crowdsourcing problem, in which an employer wishes to assign tasks to a set of...
We study the problem of selecting K arms with the highest expected rewards in a stochastic n-armed b...
There are numerous situations when a service requester wishes to expertsource a series of identical ...
We survey the literature on multi-armed bandit models and their applications in economics. The multi...