†Joint first authors. Abstract — We introduce a dynamic neural algorithm called Dynamic Neural (DN) SARSA(λ) for learning a behavioral se-quence from delayed reward. DN-SARSA(λ) combines Dynamic Field Theory models of behavioral sequence representation, classical reinforcement learning, and a computational neuro-science model of working memory, called Item and Order work-ing memory, which serves as an eligibility trace. DN-SARSA(λ) is implemented on both a simulated and real robot that must learn a specific rewarding sequence of elementary behaviors from exploration. Results show DN-SARSA(λ) performs on the level of the discrete SARSA(λ), validating the feasibility of general reinforcement learning without compromising neural dynamics. I
We develop a novel, biologically detailed neural model of reinforcement learning (RL) processes in t...
When solving complex machine learning tasks, it is often more practical to let the agent find an ade...
AbstractDistal reward refers to a class of problems where reward is temporally distal from actions t...
Abstract. We present here a simulated model of a mobile Kuka Youbot which makes use of Dynamic Field...
A key problem in reinforcement learning is how an animal is able to learn a sequence of movements wh...
Storing and reproducing temporal intervals is an important component of perception, action generatio...
In this paper, the Reinforcement Learning problem is formulated equivalently to a Markov Decision Pr...
Neurally inspired robotics already has a long history that includes reactive systems emulating refle...
In the ¯eld of machine learning, reinforcement learning constitutes the idea of enabling machines to...
Abstract—A core requirement for autonomous robotic agents is that they be able to initiate actions t...
A neural model is described of how adaptively timed reinforcement learning occurs. The adaptive timi...
An animals’ ability to learn how to make decisions based on sensory evidence is often well described...
Neurally inspired robotics already has a long history that includes reactive systems emulating refle...
Abstract. Learning in the brain is associated with changes of connec-tion strengths between neurons....
Many of our everyday tasks require the control of the serial order and the timing of component actio...
We develop a novel, biologically detailed neural model of reinforcement learning (RL) processes in t...
When solving complex machine learning tasks, it is often more practical to let the agent find an ade...
AbstractDistal reward refers to a class of problems where reward is temporally distal from actions t...
Abstract. We present here a simulated model of a mobile Kuka Youbot which makes use of Dynamic Field...
A key problem in reinforcement learning is how an animal is able to learn a sequence of movements wh...
Storing and reproducing temporal intervals is an important component of perception, action generatio...
In this paper, the Reinforcement Learning problem is formulated equivalently to a Markov Decision Pr...
Neurally inspired robotics already has a long history that includes reactive systems emulating refle...
In the ¯eld of machine learning, reinforcement learning constitutes the idea of enabling machines to...
Abstract—A core requirement for autonomous robotic agents is that they be able to initiate actions t...
A neural model is described of how adaptively timed reinforcement learning occurs. The adaptive timi...
An animals’ ability to learn how to make decisions based on sensory evidence is often well described...
Neurally inspired robotics already has a long history that includes reactive systems emulating refle...
Abstract. Learning in the brain is associated with changes of connec-tion strengths between neurons....
Many of our everyday tasks require the control of the serial order and the timing of component actio...
We develop a novel, biologically detailed neural model of reinforcement learning (RL) processes in t...
When solving complex machine learning tasks, it is often more practical to let the agent find an ade...
AbstractDistal reward refers to a class of problems where reward is temporally distal from actions t...