How humans achieve long-term goals in an uncertain environment, via repeated trials and noisy observations, is an important problem in cognitive science. We investigate this behavior in the context of a multi-armed bandit task. We compare human behavior to a variety of models that vary in their representational and computational complexity. Our result shows that subjects' choices, on a trial-to-trial basis, are best captured by a forgetful" Bayesian iterative learning model in combination with a partially myopic decision policy known as Knowledge Gradient. This model accounts for subjects' trial-by-trial choice better than a number of other previously proposed models, including optimal Bayesian learning and risk minimization, epsilon-greedy...
Humans are often faced with an exploration-versus-exploitation trade-off. A commonly used paradigm, ...
Prescriptive Bayesian decision making has reached a high level of maturity and is well-supported alg...
Computational learning models are critical for understanding mechanisms of adaptive behavior. Howeve...
How humans achieve long-term goals in an uncertain environment, via repeated trials and noisy observ...
How people achieve long-term goals in an imperfectly known environment, via repeated tries and noisy...
How people achieve long-term goals in an imperfectly known environment, via repeated tries and noisy...
The bandit problem is a dynamic decision-making task that is simply described, well-suited to contro...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
We study learning in a bandit task in which the outcome probabilities of six arms switch (“jump”) ov...
We study bandit problems in which a decision-maker gets reward-or-failure feedback when choosing rep...
Abstract—We present a formal model of human decision-making in explore-exploit tasks using the conte...
Mathematical decision making theory has been successfully applied to the neuroscience of sensation, ...
Learning and decision making is one of the universal cornerstones of human and animal life. There ar...
We study human learning & decision-making in tasks with probabilistic rewards. Recent studies in...
In repeated decision problems for which it is possible to learn from experience, people should activ...
Humans are often faced with an exploration-versus-exploitation trade-off. A commonly used paradigm, ...
Prescriptive Bayesian decision making has reached a high level of maturity and is well-supported alg...
Computational learning models are critical for understanding mechanisms of adaptive behavior. Howeve...
How humans achieve long-term goals in an uncertain environment, via repeated trials and noisy observ...
How people achieve long-term goals in an imperfectly known environment, via repeated tries and noisy...
How people achieve long-term goals in an imperfectly known environment, via repeated tries and noisy...
The bandit problem is a dynamic decision-making task that is simply described, well-suited to contro...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
We study learning in a bandit task in which the outcome probabilities of six arms switch (“jump”) ov...
We study bandit problems in which a decision-maker gets reward-or-failure feedback when choosing rep...
Abstract—We present a formal model of human decision-making in explore-exploit tasks using the conte...
Mathematical decision making theory has been successfully applied to the neuroscience of sensation, ...
Learning and decision making is one of the universal cornerstones of human and animal life. There ar...
We study human learning & decision-making in tasks with probabilistic rewards. Recent studies in...
In repeated decision problems for which it is possible to learn from experience, people should activ...
Humans are often faced with an exploration-versus-exploitation trade-off. A commonly used paradigm, ...
Prescriptive Bayesian decision making has reached a high level of maturity and is well-supported alg...
Computational learning models are critical for understanding mechanisms of adaptive behavior. Howeve...