Theories of reward learning in neuroscience have focused on two families of algorithms, thought to capture deliberative vs. habitual choice. Model-based algorithms compute the value of candidate actions from scratch, whereas model-free algorithms make choice more efficient but less flexible by storing pre-computed action values. We examine an intermediate algorithmic family, the successor representation (SR), which balances flexibility and efficiency by storing partially computed action values: predictions about future events. These pre-computation strategies differ in how they update their choices following changes in a task. SR's reliance on stored predictions about future states predicts a unique signature of insensitivity to changes in ...
The successor representation was introduced into reinforcement learning by Dayan (1993) as a means o...
Flexible action selection requires knowledge about how alternative actions impact the environment: a...
How inputs are represented is critical for performance in decision-making problems since it determin...
Theories of reward learning in neuroscience have focused on two families of algorithms thought to ca...
Humans and animals are capable of evaluating actions by considering their long-run future rewards th...
National audienceAccounting for behavioural capabilities and flexibilities experimentally observed i...
Humans and animals are capable of evaluating actions by considering their long-run future rewards th...
Many of the decisions we make in our everyday lives are sequential and entail sparse rewards. While ...
<p>To behave adaptively, we must learn from the consequences of our actions. Doing so is difficult w...
How do we use our memories of the past to guide decisions we’ve never had to make before? Although e...
Reinforcement learning systems usually assume that a value function is defined over all states (or s...
The ability to predict long-term future rewards is crucial for survival. Some animals may have to ...
Reinforcement learning constitutes a valuable framework for reward-based decision making in humans, ...
The authors propose a reinforcement-learning mechanism as a model for recurrent choice and extend it...
Recent research has shown that perceptual processing of stimuli previously associated with high-valu...
The successor representation was introduced into reinforcement learning by Dayan (1993) as a means o...
Flexible action selection requires knowledge about how alternative actions impact the environment: a...
How inputs are represented is critical for performance in decision-making problems since it determin...
Theories of reward learning in neuroscience have focused on two families of algorithms thought to ca...
Humans and animals are capable of evaluating actions by considering their long-run future rewards th...
National audienceAccounting for behavioural capabilities and flexibilities experimentally observed i...
Humans and animals are capable of evaluating actions by considering their long-run future rewards th...
Many of the decisions we make in our everyday lives are sequential and entail sparse rewards. While ...
<p>To behave adaptively, we must learn from the consequences of our actions. Doing so is difficult w...
How do we use our memories of the past to guide decisions we’ve never had to make before? Although e...
Reinforcement learning systems usually assume that a value function is defined over all states (or s...
The ability to predict long-term future rewards is crucial for survival. Some animals may have to ...
Reinforcement learning constitutes a valuable framework for reward-based decision making in humans, ...
The authors propose a reinforcement-learning mechanism as a model for recurrent choice and extend it...
Recent research has shown that perceptual processing of stimuli previously associated with high-valu...
The successor representation was introduced into reinforcement learning by Dayan (1993) as a means o...
Flexible action selection requires knowledge about how alternative actions impact the environment: a...
How inputs are represented is critical for performance in decision-making problems since it determin...