Sequential decision-making is an iterative process between a learning agent and an environment. We study the stochastic setting, where the learner chooses an action in each round and the environment returns a noisy feedback signal. The learner's objective is to maximize a reward function that depends on the chosen actions. This basic model has many applications, including adaptive experimental design, product recommendations, dynamic pricing and black-box optimization. The combination of statistical uncertainty and the objective to maximize reward creates a tension between exploration and exploitation: The learner has to carefully balance between actions that provide informative feedback and actions estimated to yield a high reward. The...
Abstract. We consider Bayesian information collection, in which a measurement policy collects inform...
The principles of statistical mechanics and information theory play an important role in learning an...
We are interested in the problem of utilizing collected data to inform and direct learning towards a...
We propose information-directed sampling – a new algorithm for online optimization prob-lems in whic...
Partial monitoring is a rich framework for sequential decision making under uncertainty that general...
Partial monitoring is an expressive framework for sequential decision-making with an abundance of ap...
Information-directed sampling (IDS) has revealed its potential as a data-efficient algorithm for rei...
We consider stochastic multi-armed bandit problems with graph feedback, where the decision maker is ...
Purpose Sampling an action according to the probability that the action is believed to be the optima...
Purpose Sampling an action according to the probability that the action is believed to be the optima...
In sequential decision problems in an unknown environment, the decision maker often faces a dilemma ...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
In this work we consider probabilistic approaches to sequential decision making. The ultimate goal i...
In this work we consider probabilistic approaches to sequential decision making. The ultimate goal i...
Abstract. We consider Bayesian information collection, in which a measurement policy collects inform...
The principles of statistical mechanics and information theory play an important role in learning an...
We are interested in the problem of utilizing collected data to inform and direct learning towards a...
We propose information-directed sampling – a new algorithm for online optimization prob-lems in whic...
Partial monitoring is a rich framework for sequential decision making under uncertainty that general...
Partial monitoring is an expressive framework for sequential decision-making with an abundance of ap...
Information-directed sampling (IDS) has revealed its potential as a data-efficient algorithm for rei...
We consider stochastic multi-armed bandit problems with graph feedback, where the decision maker is ...
Purpose Sampling an action according to the probability that the action is believed to be the optima...
Purpose Sampling an action according to the probability that the action is believed to be the optima...
In sequential decision problems in an unknown environment, the decision maker often faces a dilemma ...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
In this work we consider probabilistic approaches to sequential decision making. The ultimate goal i...
In this work we consider probabilistic approaches to sequential decision making. The ultimate goal i...
Abstract. We consider Bayesian information collection, in which a measurement policy collects inform...
The principles of statistical mechanics and information theory play an important role in learning an...
We are interested in the problem of utilizing collected data to inform and direct learning towards a...