How humans achieve long-term goals in an uncertain environment, via repeated trials and noisy observations, is an important problem in cognitive science. We investigate this behavior in the context of a multi-armed bandit task. We com-pare human behavior to a variety of models that vary in their representational and computational complexity. Our result shows that subjects ’ choices, on a trial-to-trial basis, are best captured by a “forgetful ” Bayesian iterative learning model [21] in combination with a partially myopic decision policy known as Knowl-edge Gradient [7]. This model accounts for subjects ’ trial-by-trial choice better than a number of other previously proposed models, including optimal Bayesian learning and risk minimization,...
We study human learning & decision-making in tasks with probabilistic rewards. Recent studies in...
Reinforcement learning algorithms have provided useful insights into human and an- imal learning and...
Humans navigate daily decision-making by flexibly choosing appropriate approximations of what ought ...
How humans achieve long-term goals in an uncertain environment, via repeated trials and noisy observ...
How people achieve long-term goals in an imperfectly known environment, via repeated tries and noisy...
How people achieve long-term goals in an imperfectly known environment, via repeated tries and noisy...
The bandit problem is a dynamic decision-making task that is simply described, well-suited to contro...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
Abstract—We present a formal model of human decision-making in explore-exploit tasks using the conte...
We study bandit problems in which a decision-maker gets reward-or-failure feedback when choosing rep...
We study learning in a bandit task in which the outcome probabilities of six arms switch (“jump”) ov...
Mathematical decision making theory has been successfully applied to the neuroscience of sensation, ...
Learning and decision making is one of the universal cornerstones of human and animal life. There ar...
Prescriptive Bayesian decision making has reached a high level of maturity and is well-supported alg...
Computational learning models are critical for understanding mechanisms of adaptive behavior. Howeve...
We study human learning & decision-making in tasks with probabilistic rewards. Recent studies in...
Reinforcement learning algorithms have provided useful insights into human and an- imal learning and...
Humans navigate daily decision-making by flexibly choosing appropriate approximations of what ought ...
How humans achieve long-term goals in an uncertain environment, via repeated trials and noisy observ...
How people achieve long-term goals in an imperfectly known environment, via repeated tries and noisy...
How people achieve long-term goals in an imperfectly known environment, via repeated tries and noisy...
The bandit problem is a dynamic decision-making task that is simply described, well-suited to contro...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
Abstract—We present a formal model of human decision-making in explore-exploit tasks using the conte...
We study bandit problems in which a decision-maker gets reward-or-failure feedback when choosing rep...
We study learning in a bandit task in which the outcome probabilities of six arms switch (“jump”) ov...
Mathematical decision making theory has been successfully applied to the neuroscience of sensation, ...
Learning and decision making is one of the universal cornerstones of human and animal life. There ar...
Prescriptive Bayesian decision making has reached a high level of maturity and is well-supported alg...
Computational learning models are critical for understanding mechanisms of adaptive behavior. Howeve...
We study human learning & decision-making in tasks with probabilistic rewards. Recent studies in...
Reinforcement learning algorithms have provided useful insights into human and an- imal learning and...
Humans navigate daily decision-making by flexibly choosing appropriate approximations of what ought ...