Learning theorists posit two reinforcement learning systems: model-free and model-based. Model-based learning incorporates knowledge about structure and contingencies in the world to assign candidate actions with an expected value. Model-free learning is ignorant of the world’s structure; instead, actions hold a value based on prior reinforcement, with this value updated by expectancy violation in the form of a reward prediction error. Because they use such different learning mechanisms, it has been previously assumed that model-based and model-free learning are computationally dissociated in the brain. However, recent fMRI evidence suggests that the brain may compute reward prediction errors to both model-free and model-based estimates of ...
A major open question is whether computational strategies thought to be used during experiential lea...
Adaptive decision making depends on the accurate representation of rewards associated with potential...
Learning how to reach a reward over long series of actions is a remarkable capability of humans, and...
SummaryReinforcement learning (RL) uses sequential experience with situations (“states”) and outcome...
Reinforcement learning (RL) provides a framework involving two diverse approaches to reward-based de...
SummaryReinforcement learning (RL) uses sequential experience with situations (“states”) and outcome...
A standard assumption in neuroscience is that low-effort model-free learning is automatic and contin...
Reinforcement learning (RL) uses sequential experience with situations (“states”) and outcomes to as...
Reinforcement learning (RL) uses sequential experience with situations (“states”) and outcomes to as...
Learning occurs when an outcome deviates from expectation (prediction error). According to formal le...
Learning occurs when an outcome deviates from expectation (prediction error). According to formal le...
Learning occurs when an outcome deviates from expectation (prediction error). According to formal le...
Prediction-error signals consistent with formal models of "reinforcement learning" (RL) have repeate...
Reinforcement learning describes motivated behavior in terms of two abstract signals. The representa...
A major open question is whether computational strategies thought to be used during experiential lea...
A major open question is whether computational strategies thought to be used during experiential lea...
Adaptive decision making depends on the accurate representation of rewards associated with potential...
Learning how to reach a reward over long series of actions is a remarkable capability of humans, and...
SummaryReinforcement learning (RL) uses sequential experience with situations (“states”) and outcome...
Reinforcement learning (RL) provides a framework involving two diverse approaches to reward-based de...
SummaryReinforcement learning (RL) uses sequential experience with situations (“states”) and outcome...
A standard assumption in neuroscience is that low-effort model-free learning is automatic and contin...
Reinforcement learning (RL) uses sequential experience with situations (“states”) and outcomes to as...
Reinforcement learning (RL) uses sequential experience with situations (“states”) and outcomes to as...
Learning occurs when an outcome deviates from expectation (prediction error). According to formal le...
Learning occurs when an outcome deviates from expectation (prediction error). According to formal le...
Learning occurs when an outcome deviates from expectation (prediction error). According to formal le...
Prediction-error signals consistent with formal models of "reinforcement learning" (RL) have repeate...
Reinforcement learning describes motivated behavior in terms of two abstract signals. The representa...
A major open question is whether computational strategies thought to be used during experiential lea...
A major open question is whether computational strategies thought to be used during experiential lea...
Adaptive decision making depends on the accurate representation of rewards associated with potential...
Learning how to reach a reward over long series of actions is a remarkable capability of humans, and...