Forgetful Bayes and myopic planning: Human learning and decision-making in a bandit setting

Zhang, Shunan
Yu, Angela J

Publication date

January 2013

Publisher

Curran Associates, Inc.

Abstract

How humans achieve long-term goals in an uncertain environment, via repeated trials and noisy observations, is an important problem in cognitive science. We investigate this behavior in the context of a multi-armed bandit task. We compare human behavior to a variety of models that vary in their representational and computational complexity. Our result shows that subjects' choices, on a trial-to-trial basis, are best captured by a forgetful" Bayesian iterative learning model in combination with a partially myopic decision policy known as Knowledge Gradient. This model accounts for subjects' trial-by-trial choice better than a number of other previously proposed models, including optimal Bayesian learning and risk minimization, epsilon-greedy...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Forgetful Bayes and myopic planning: Human learning and decision-making in a bandit setting

Abstract

Extracted data

Forgetful Bayes and myopic planning: Human learning and decision-making in a bandit setting

Abstract

Extracted data

Related items

Related items