Prediction-error signals consistent with formal models of “reinforcement learning” (RL) have repeatedly been found within dopaminergic nuclei of the midbrain and dopaminoceptive areas of the striatum. However, the precise form of the RL algorithms implemented in the human brain is not yet well determined. Here, we created a novel paradigm optimized to dissociate the subtypes of reward-prediction errors that function as the key computational signatures of two distinct classes of RL models—namely, “actor/critic” models and action-value-learning models (e.g., the Q-learning model). The state-value-prediction error (SVPE), which is independent of actions, is a hallmark of the actor/critic architecture, whereas the action-value-prediction error ...
Many previous studies of the brain areas involved in reward prediction errors have not accounted for...
The computational framework of reinforcement learning has been used to forward our understanding of ...
Learning how to reach a reward over long series of actions is a remarkable capability of humans, and...
Prediction-error signals consistent with formal models of "reinforcement learning" (RL) have repeate...
Prediction-error signals consistent with formal models of “reinforcement learning” (RL) have repeate...
Reinforcement learning (RL) uses sequential experience with situations (“states”) and outcomes to as...
SummaryReinforcement learning (RL) uses sequential experience with situations (“states”) and outcome...
Reinforcement learning (RL) provides a framework involving two diverse approaches to reward-based de...
Reinforcement learning describes motivated behavior in terms of two abstract signals. The representa...
Reinforcement learning (RL) in simple instrumental tasks is usually modeled as a monolithic process ...
Reinforcement learning (RL) uses sequential experience with situations (“states”) and outcomes to as...
To repeat actions that lead to good outcomes and to avoid repeating actions that lead to undesirable...
Learning occurs when an outcome deviates from expectation (prediction error). According to formal le...
AbstractAccumulating evidence from nonhuman primates suggests that midbrain dopamine cells code rewa...
Goal-directed and instrumental learning are both important controllers of human behavior. Learning a...
Many previous studies of the brain areas involved in reward prediction errors have not accounted for...
The computational framework of reinforcement learning has been used to forward our understanding of ...
Learning how to reach a reward over long series of actions is a remarkable capability of humans, and...
Prediction-error signals consistent with formal models of "reinforcement learning" (RL) have repeate...
Prediction-error signals consistent with formal models of “reinforcement learning” (RL) have repeate...
Reinforcement learning (RL) uses sequential experience with situations (“states”) and outcomes to as...
SummaryReinforcement learning (RL) uses sequential experience with situations (“states”) and outcome...
Reinforcement learning (RL) provides a framework involving two diverse approaches to reward-based de...
Reinforcement learning describes motivated behavior in terms of two abstract signals. The representa...
Reinforcement learning (RL) in simple instrumental tasks is usually modeled as a monolithic process ...
Reinforcement learning (RL) uses sequential experience with situations (“states”) and outcomes to as...
To repeat actions that lead to good outcomes and to avoid repeating actions that lead to undesirable...
Learning occurs when an outcome deviates from expectation (prediction error). According to formal le...
AbstractAccumulating evidence from nonhuman primates suggests that midbrain dopamine cells code rewa...
Goal-directed and instrumental learning are both important controllers of human behavior. Learning a...
Many previous studies of the brain areas involved in reward prediction errors have not accounted for...
The computational framework of reinforcement learning has been used to forward our understanding of ...
Learning how to reach a reward over long series of actions is a remarkable capability of humans, and...