In many daily tasks, we make multiple decisions before reaching a goal. In order to learn such sequences of decisions, a mechanism to link earlier actions to later reward is necessary. Reinforcement learning (RL) theory suggests two classes of algorithms solving this credit assignment problem: In classic temporal-difference learning, earlier actions receive reward information only after multiple repetitions of the task, whereas models with eligibility traces reinforce entire sequences of actions from a single experience (one-shot). Here, we show one-shot learning of sequences. We developed a novel paradigm to directly observe which actions and states along a multi-step sequence are reinforced after a single reward. By focusing our analysis ...
Everyday tasks seldom involve isolate actions but sequences of them. We can see whether previous act...
Everyday tasks seldom involve isolate actions but sequences of them. We can see whether previous act...
We present a model for plasticity induction in reinforcement learning which is based on a cascade of...
In many daily tasks we make multiple decisions before reaching a goal. In order to learn such sequen...
Whether we prepare a coffee or navigate to a shop: in many tasks we make multiple decisions before r...
Human adaptive decision-making recruits multiple cognitive processes for learning stimulus-action (S...
Many of the decisions we make in our everyday lives are sequential and entail sparse rewards. While ...
Many of the decisions we make in our everyday lives are sequential and entail sparse rewards. While ...
In reinforcement learning (RL), an agent makes sequential decisions to maximise the reward it can ob...
Learning sequential actions is an essential ability, for most daily activities are sequential. We mo...
The question of how to determine which states and actions are responsible for a certain outcome is k...
<p>To behave adaptively, we must learn from the consequences of our actions. Doing so is difficult w...
Theories of reward learning in neuroscience have focused on two families of algorithms, thought to c...
Reinforcement learning systems usually assume that a value function is defined over all states (or s...
The authors propose a reinforcement-learning mechanism as a model for recurrent choice and extend it...
Everyday tasks seldom involve isolate actions but sequences of them. We can see whether previous act...
Everyday tasks seldom involve isolate actions but sequences of them. We can see whether previous act...
We present a model for plasticity induction in reinforcement learning which is based on a cascade of...
In many daily tasks we make multiple decisions before reaching a goal. In order to learn such sequen...
Whether we prepare a coffee or navigate to a shop: in many tasks we make multiple decisions before r...
Human adaptive decision-making recruits multiple cognitive processes for learning stimulus-action (S...
Many of the decisions we make in our everyday lives are sequential and entail sparse rewards. While ...
Many of the decisions we make in our everyday lives are sequential and entail sparse rewards. While ...
In reinforcement learning (RL), an agent makes sequential decisions to maximise the reward it can ob...
Learning sequential actions is an essential ability, for most daily activities are sequential. We mo...
The question of how to determine which states and actions are responsible for a certain outcome is k...
<p>To behave adaptively, we must learn from the consequences of our actions. Doing so is difficult w...
Theories of reward learning in neuroscience have focused on two families of algorithms, thought to c...
Reinforcement learning systems usually assume that a value function is defined over all states (or s...
The authors propose a reinforcement-learning mechanism as a model for recurrent choice and extend it...
Everyday tasks seldom involve isolate actions but sequences of them. We can see whether previous act...
Everyday tasks seldom involve isolate actions but sequences of them. We can see whether previous act...
We present a model for plasticity induction in reinforcement learning which is based on a cascade of...