The discovery of reusable sub-routines simplifies decision-making and planning in complex reinforcement learning problems. Previous approaches propose to learn such temporal abstractions in a purely unsupervised fashion through observing state-action trajectories gathered from executing a policy. However, a current limitation is that they process each trajectory in an entirely sequential manner, which prevents them from revising earlier decisions about sub-routine boundary points in light of new incoming information. In this work we propose SloTTAr, a fully parallel approach that integrates sequence processing Transformers with a Slot Attention module and adaptive computation for learning about the number of such sub-routines in an unsuperv...
†Joint first authors. Abstract — We introduce a dynamic neural algorithm called Dynamic Neural (DN) ...
Abstract. Reinforcement learning is bedeviled by the curse of dimensionality: the number of paramete...
The convergence property of reinforcement learning has been extensively investigated in the field of...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Biological agents do not have infinite resources to learn new things. For this reason, a central asp...
Rewards play an essential role in reinforcement learning. In contrast to rule-based game environment...
The transformer architecture and variants presented remarkable success across many machine learning ...
Abstraction of complex, longer motor tasks into simpler elemental movements enables humans and anima...
Abstraction of complex, longer motor tasks into simpler elemental movements enables humans and anima...
We show that the presented architecture allows for unsupervised learning; that synaptic rewiring enh...
Transformer architectures are now central to sequence modeling tasks. At its heart is the attention ...
This paper describes an unsupervised neural network model for learning and recall of temporal patter...
Humans can learn several tasks in succession with minimal mutual interference but perform more poorl...
This article discusses the unsupervised learning of a network for a temporally precise sequence. A n...
Item does not contain fulltextPeople learn and use complex sequential actions on a daily basis, desp...
†Joint first authors. Abstract — We introduce a dynamic neural algorithm called Dynamic Neural (DN) ...
Abstract. Reinforcement learning is bedeviled by the curse of dimensionality: the number of paramete...
The convergence property of reinforcement learning has been extensively investigated in the field of...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Biological agents do not have infinite resources to learn new things. For this reason, a central asp...
Rewards play an essential role in reinforcement learning. In contrast to rule-based game environment...
The transformer architecture and variants presented remarkable success across many machine learning ...
Abstraction of complex, longer motor tasks into simpler elemental movements enables humans and anima...
Abstraction of complex, longer motor tasks into simpler elemental movements enables humans and anima...
We show that the presented architecture allows for unsupervised learning; that synaptic rewiring enh...
Transformer architectures are now central to sequence modeling tasks. At its heart is the attention ...
This paper describes an unsupervised neural network model for learning and recall of temporal patter...
Humans can learn several tasks in succession with minimal mutual interference but perform more poorl...
This article discusses the unsupervised learning of a network for a temporally precise sequence. A n...
Item does not contain fulltextPeople learn and use complex sequential actions on a daily basis, desp...
†Joint first authors. Abstract — We introduce a dynamic neural algorithm called Dynamic Neural (DN) ...
Abstract. Reinforcement learning is bedeviled by the curse of dimensionality: the number of paramete...
The convergence property of reinforcement learning has been extensively investigated in the field of...