Humans often face sequential decision-making problems, in which information about the environmental reward structure is detached from rewards for a subset of actions. In the current exploratory study, we introduce an information-selective symmetric reversal bandit task to model such situations and obtained choice data on this task from 24 participants. To arbitrate between different decision-making strategies that participants may use on this task, we developed a set of probabilistic agent-based behavioral models, including exploitative and explorative Bayesian agents, as well as heuristic control agents. Upon validating the model and parameter recovery properties of our model set and summarizing the participants' choice data in a descripti...
tatsujit[at]mail.dendai.ac.jp In an uncertain environment, decision-making meets two opposing demand...
Computational models of learning have proved largely successful in characterizing potential mechanis...
How do people solve the explore-exploit trade-off in a changing environment? In this paper we presen...
Humans are often faced with an exploration-versus-exploitation trade-off. A commonly used paradigm, ...
Humans are often faced with an exploration-versus-exploitation trade-off. A commonly used paradigm, ...
The bandit problem is a dynamic decision-making task that is simply described, well-suited to contro...
We consider a class of bandit problems in which a decision-maker must choose between a set of altern...
We study bandit problems in which a decision-maker gets reward-or-failure feedback when choosing rep...
We study bandit problems in which a decision-maker gets reward-or-failure feedback when choosing rep...
Research in cognitive psychology regarding sequential decision-making usually involves tasks where a...
We study human learning & decision-making in tasks with probabilistic rewards. Recent studies in...
Thesis (Ph.D.)--University of Washington, 2021Existing computational models of decision making are o...
All adaptive organisms face the fundamental tradeoff between pursuing a known reward (exploitation) ...
Humans frequently overestimate the likelihood of desirable events while underestimating the likeliho...
The tradeoff between pursuing a known reward (exploitation) and sampling unknown, potentially better...
tatsujit[at]mail.dendai.ac.jp In an uncertain environment, decision-making meets two opposing demand...
Computational models of learning have proved largely successful in characterizing potential mechanis...
How do people solve the explore-exploit trade-off in a changing environment? In this paper we presen...
Humans are often faced with an exploration-versus-exploitation trade-off. A commonly used paradigm, ...
Humans are often faced with an exploration-versus-exploitation trade-off. A commonly used paradigm, ...
The bandit problem is a dynamic decision-making task that is simply described, well-suited to contro...
We consider a class of bandit problems in which a decision-maker must choose between a set of altern...
We study bandit problems in which a decision-maker gets reward-or-failure feedback when choosing rep...
We study bandit problems in which a decision-maker gets reward-or-failure feedback when choosing rep...
Research in cognitive psychology regarding sequential decision-making usually involves tasks where a...
We study human learning & decision-making in tasks with probabilistic rewards. Recent studies in...
Thesis (Ph.D.)--University of Washington, 2021Existing computational models of decision making are o...
All adaptive organisms face the fundamental tradeoff between pursuing a known reward (exploitation) ...
Humans frequently overestimate the likelihood of desirable events while underestimating the likeliho...
The tradeoff between pursuing a known reward (exploitation) and sampling unknown, potentially better...
tatsujit[at]mail.dendai.ac.jp In an uncertain environment, decision-making meets two opposing demand...
Computational models of learning have proved largely successful in characterizing potential mechanis...
How do people solve the explore-exploit trade-off in a changing environment? In this paper we presen...