How do people decide whether to try out novel options as opposed to tried-and-tested ones? We argue that they infer a novel option's reward from contextual information learned from functional relations and take uncertainty into account when making a decision. We propose a Bayesian optimization model to describe their learning and decision making. This model relies on similarity-based learning of functional relationships between features and rewards, and a choice rule that balances exploration and exploitation by combining predicted rewards and the uncertainty of these predictions. Our model makes two main predictions. First, decision makers who learn functional relationships will generalize based on the learned reward function, choosing nov...
Humans are often faced with an exploration-versus-exploitation trade-off. A commonly used paradigm, ...
SummaryHow do individuals decide to act based on a rewarding status quo versus an unexplored choice ...
Little is known about how humans solve the exploitation/exploration trade-off. In particular, the ev...
How do people decide whether to try out novel options as opposed to tried-and-testedones? We argue t...
Reinforcement learning algorithms have provided useful insights into human and an- imal learning and...
In repeated decision problems for which it is possible to learn from experience, people should activ...
In reinforcement learning (RL), a decision maker searching for the most rewarding option is often fa...
Little is known about how humans solve the exploitation/exploration trade-off. In particular, the ev...
To flexibly adapt to the demands of their environment, animals are constantly exposed to the conflic...
The exploitation-exploration (EE) trade-off describes how, when making a decision, an organism must ...
Successful behaviour depends on the right balance between maximising reward and soliciting informati...
Many types of intelligent behavior can be framed as a search problem, where an individual must explo...
The authors introduce the contextual multi-armed bandit task as a framework to investigate learning ...
The tradeoff between pursuing a known reward (exploitation) and sampling unknown, potentially better...
In this paper we computationally examine how subjective experience may help or harm the decision mak...
Humans are often faced with an exploration-versus-exploitation trade-off. A commonly used paradigm, ...
SummaryHow do individuals decide to act based on a rewarding status quo versus an unexplored choice ...
Little is known about how humans solve the exploitation/exploration trade-off. In particular, the ev...
How do people decide whether to try out novel options as opposed to tried-and-testedones? We argue t...
Reinforcement learning algorithms have provided useful insights into human and an- imal learning and...
In repeated decision problems for which it is possible to learn from experience, people should activ...
In reinforcement learning (RL), a decision maker searching for the most rewarding option is often fa...
Little is known about how humans solve the exploitation/exploration trade-off. In particular, the ev...
To flexibly adapt to the demands of their environment, animals are constantly exposed to the conflic...
The exploitation-exploration (EE) trade-off describes how, when making a decision, an organism must ...
Successful behaviour depends on the right balance between maximising reward and soliciting informati...
Many types of intelligent behavior can be framed as a search problem, where an individual must explo...
The authors introduce the contextual multi-armed bandit task as a framework to investigate learning ...
The tradeoff between pursuing a known reward (exploitation) and sampling unknown, potentially better...
In this paper we computationally examine how subjective experience may help or harm the decision mak...
Humans are often faced with an exploration-versus-exploitation trade-off. A commonly used paradigm, ...
SummaryHow do individuals decide to act based on a rewarding status quo versus an unexplored choice ...
Little is known about how humans solve the exploitation/exploration trade-off. In particular, the ev...