Learning to predict future outcomes is critical for driving appropriate behaviors. Reinforcement learning (RL) models have successfully accounted for such learning, relying on reward prediction errors (RPEs) signaled by midbrain dopamine neurons. It has been proposed that when sensory data provide only ambiguous information about which state an animal is in, it can predict reward based on a set of probabilities assigned to hypothetical states (called the belief state). Here we examine how dopamine RPEs and subsequent learning are regulated under state uncertainty. Mice are first trained in a task with two potential states defined by different reward amounts. During testing, intermediate-sized rewards are given in rare trials. Dopamine activ...
Substantial evidence suggests that the phasic activities of dopaminergic neurons in the primate midb...
Making predictions about the rewards associated with environmental stimuli and updating those predic...
Deciding between stimuli requires combining their learned value with one's sensory confidence. We tr...
Central to the organization of behavior is the ability to predict the values of outcomes to guide ch...
Predicting outcomes is a critical ability of humans and animals. The dopamine reward prediction erro...
To make optimal decisions and get the best outcomes, human and animal decision-makers must traverse ...
Midbrain dopamine neurons are commonly thought to report a reward prediction error (RPE), as hypothe...
Midbrain dopamine neurons signal reward prediction error (RPE), or actual minus expected reward. The...
The basal ganglia pathways and their dopaminergic innervation are essential for reinforcement learni...
To accurately predict rewards associated with states or actions, the variability of observations has...
Experiments have implicated dopamine in model-based reinforcement learning (RL). These findings are ...
Temporal difference learning models propose phasic dopamine signaling encodes reward prediction erro...
Deciding between stimuli requires combining their learned value with one's sensory confidence. We tr...
Deciding between stimuli requires combining their learned value with one's sensory confidence. We tr...
Deciding between stimuli requires combining their learned value with one's sensory confidence. We tr...
Substantial evidence suggests that the phasic activities of dopaminergic neurons in the primate midb...
Making predictions about the rewards associated with environmental stimuli and updating those predic...
Deciding between stimuli requires combining their learned value with one's sensory confidence. We tr...
Central to the organization of behavior is the ability to predict the values of outcomes to guide ch...
Predicting outcomes is a critical ability of humans and animals. The dopamine reward prediction erro...
To make optimal decisions and get the best outcomes, human and animal decision-makers must traverse ...
Midbrain dopamine neurons are commonly thought to report a reward prediction error (RPE), as hypothe...
Midbrain dopamine neurons signal reward prediction error (RPE), or actual minus expected reward. The...
The basal ganglia pathways and their dopaminergic innervation are essential for reinforcement learni...
To accurately predict rewards associated with states or actions, the variability of observations has...
Experiments have implicated dopamine in model-based reinforcement learning (RL). These findings are ...
Temporal difference learning models propose phasic dopamine signaling encodes reward prediction erro...
Deciding between stimuli requires combining their learned value with one's sensory confidence. We tr...
Deciding between stimuli requires combining their learned value with one's sensory confidence. We tr...
Deciding between stimuli requires combining their learned value with one's sensory confidence. We tr...
Substantial evidence suggests that the phasic activities of dopaminergic neurons in the primate midb...
Making predictions about the rewards associated with environmental stimuli and updating those predic...
Deciding between stimuli requires combining their learned value with one's sensory confidence. We tr...