The authors introduce the contextual multi-armed bandit task as a framework to investigate learning and decision making in uncertain environments. In this novel paradigm, participants repeatedly choose between multiple options in order to maximize their rewards. The options are described by a number of contextual features which are predictive of the rewards through initially unknown functions. From their experience with choosing options and observing the consequences of their decisions, participants can learn about the functional relation between contexts and rewards and improve their decision strategy over time. In three experiments, the authors explore participants’ behavior in such learning environments. They predict participants’ behavi...
We study human learning & decision-making in tasks with probabilistic rewards. Recent studies in...
Graduate School of Artificial Intelligence ArtificiMulti-armed bandit is a well-formulated test bed ...
This thesis consists of three studies investigating the strategy selection problem and the role of ...
The authors introduce the contextual multi-armed bandit task as a framework to investigate learning ...
In real-life decision environments people learn from their di-rect experience with alternative cours...
In repeated decision problems for which it is possible to learn from experience, people should activ...
The multi-armed bandit framework can be motivated by any problem where there is an abundance of choi...
Presented as part of the ARC11 lecture on October 30, 2017 at 10:00 a.m. in the Klaus Advanced Compu...
How do humans search for rewards? This question is commonly studied using multi-armed bandit tasks, ...
University of Technology Sydney. Faculty of Engineering and Information Technology.The sequential de...
How do people decide whether to try out novel options as opposed to tried-and-tested ones? We argue ...
Abstract—We present a formal model of human decision-making in explore-exploit tasks using the conte...
How people achieve long-term goals in an imperfectly known environment, via repeated tries and noisy...
How do people decide whether to try out novel options as opposed to tried-and-testedones? We argue t...
Reinforcement learning algorithms have provided useful insights into human and an- imal learning and...
We study human learning & decision-making in tasks with probabilistic rewards. Recent studies in...
Graduate School of Artificial Intelligence ArtificiMulti-armed bandit is a well-formulated test bed ...
This thesis consists of three studies investigating the strategy selection problem and the role of ...
The authors introduce the contextual multi-armed bandit task as a framework to investigate learning ...
In real-life decision environments people learn from their di-rect experience with alternative cours...
In repeated decision problems for which it is possible to learn from experience, people should activ...
The multi-armed bandit framework can be motivated by any problem where there is an abundance of choi...
Presented as part of the ARC11 lecture on October 30, 2017 at 10:00 a.m. in the Klaus Advanced Compu...
How do humans search for rewards? This question is commonly studied using multi-armed bandit tasks, ...
University of Technology Sydney. Faculty of Engineering and Information Technology.The sequential de...
How do people decide whether to try out novel options as opposed to tried-and-tested ones? We argue ...
Abstract—We present a formal model of human decision-making in explore-exploit tasks using the conte...
How people achieve long-term goals in an imperfectly known environment, via repeated tries and noisy...
How do people decide whether to try out novel options as opposed to tried-and-testedones? We argue t...
Reinforcement learning algorithms have provided useful insights into human and an- imal learning and...
We study human learning & decision-making in tasks with probabilistic rewards. Recent studies in...
Graduate School of Artificial Intelligence ArtificiMulti-armed bandit is a well-formulated test bed ...
This thesis consists of three studies investigating the strategy selection problem and the role of ...