Recently, neuro-inspired episodic control (EC) methods have been developed to overcome the data-inefficiency of standard deep reinforcement learning approaches. Using non-/semi-parametric models to estimate the value function, they learn rapidly, retrieving cached values from similar past states. In realistic scenarios, with limited resources and noisy data, maintaining meaningful representations in memory is essential to speed up the learning and avoid catastrophic forgetting. Unfortunately, EC methods have a large space and time complexity. We investigate different solutions to these problems based on prioritising and ranking stored states, as well as online clustering techniques. We also propose a new dynamic online k-means algorithm tha...
This paper studies the use of Reinforcement Learning (RL) policies for optimizing the sequencing of...
Treballs Finals de Grau de Física, Facultat de Física, Universitat de Barcelona, Any: 2015, Tutor: P...
The study of a semi-supervised clustering has recently attracted great interest from the data cluste...
Non-parametric episodic memory can be used to quickly latch onto high-reward experience in reinforce...
Achieving sample efficiency in online episodic reinforcement learning (RL) requires optimally balanc...
Episodic control enables sample efficiency in reinforcement learning by recalling past experiences f...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
International audienceA longstanding goal in reinforcement learning is to build intelligent agents t...
2017 Summer.Includes bibliographical references.This dissertation presents knowledge acquisition and...
Modern deep reinforcement learning (RL) algorithms, despite being at the forefront of artificial int...
Deep learning has revolutionised artificial intelligence, where the application of increased compute...
Recent advances in reinforcement-learning research have demonstrated impressive results in building ...
This paper explains an episodic-memory based approach for computing anticipatory robot behavior in ...
We describe a density-adaptive reinforcement learning and a density-adaptive forgetting algorithm. ...
Online reinforcement learning agents are currently able to process an increasing amount of data by c...
This paper studies the use of Reinforcement Learning (RL) policies for optimizing the sequencing of...
Treballs Finals de Grau de Física, Facultat de Física, Universitat de Barcelona, Any: 2015, Tutor: P...
The study of a semi-supervised clustering has recently attracted great interest from the data cluste...
Non-parametric episodic memory can be used to quickly latch onto high-reward experience in reinforce...
Achieving sample efficiency in online episodic reinforcement learning (RL) requires optimally balanc...
Episodic control enables sample efficiency in reinforcement learning by recalling past experiences f...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
International audienceA longstanding goal in reinforcement learning is to build intelligent agents t...
2017 Summer.Includes bibliographical references.This dissertation presents knowledge acquisition and...
Modern deep reinforcement learning (RL) algorithms, despite being at the forefront of artificial int...
Deep learning has revolutionised artificial intelligence, where the application of increased compute...
Recent advances in reinforcement-learning research have demonstrated impressive results in building ...
This paper explains an episodic-memory based approach for computing anticipatory robot behavior in ...
We describe a density-adaptive reinforcement learning and a density-adaptive forgetting algorithm. ...
Online reinforcement learning agents are currently able to process an increasing amount of data by c...
This paper studies the use of Reinforcement Learning (RL) policies for optimizing the sequencing of...
Treballs Finals de Grau de Física, Facultat de Física, Universitat de Barcelona, Any: 2015, Tutor: P...
The study of a semi-supervised clustering has recently attracted great interest from the data cluste...