International audienceA longstanding goal in reinforcement learning is to build intelligent agents that show fast learning and a flexible transfer of skills akin to humans and animals. This paper investigates the integration of two frameworks for tackling those goals: episodic control and successor features. Episodic control is a cognitively inspired approach relying on episodic memory, an instance-based memory model of an agent's experiences. Meanwhile, successor features and generalized policy improvement (SF&GPI) is a meta and transfer learning framework allowing to learn policies for tasks that can be efficiently reused for later tasks which have a different reward function. Individually, these two techniques have shown impressive resul...
The process for transferring knowledge of multiple reinforcement learning policies into a single mul...
For decades, neuroscientists and psychologists have observed that animal performance on spatial navi...
Recently, neuro-inspired episodic control (EC) methods have been developed to overcome the data-inef...
International audienceA longstanding goal in reinforcement learning is to build intelligent agents t...
Episodic control enables sample efficiency in reinforcement learning by recalling past experiences f...
The successor representation was introduced into reinforcement learning by Dayan (1993) as a means o...
Treball fi de màster de: Master in Cognitive Systems and Interactive MediaDirectors: Ismael T. Freir...
Modern deep reinforcement learning (RL) algorithms, despite being at the forefront of artificial int...
Abstract: Successor Features stand at the boundary between modelfree and model-based Reinforcement L...
International audienceAugmenting the representation of the current state of the external world with ...
Memory lies at the heart of human cognitive abilities. Therefore, understanding it from neural, psyc...
Reinforcement learning systems usually assume that a value function is defined over all states (or s...
Elements from cognitive psychology have been applied in a variety of ways to artificial intelligence...
Online reinforcement learning agents are currently able to process an increasing amount of data by c...
Deep reinforcement learning has shown great potential in training dialogue policies. However, its fa...
The process for transferring knowledge of multiple reinforcement learning policies into a single mul...
For decades, neuroscientists and psychologists have observed that animal performance on spatial navi...
Recently, neuro-inspired episodic control (EC) methods have been developed to overcome the data-inef...
International audienceA longstanding goal in reinforcement learning is to build intelligent agents t...
Episodic control enables sample efficiency in reinforcement learning by recalling past experiences f...
The successor representation was introduced into reinforcement learning by Dayan (1993) as a means o...
Treball fi de màster de: Master in Cognitive Systems and Interactive MediaDirectors: Ismael T. Freir...
Modern deep reinforcement learning (RL) algorithms, despite being at the forefront of artificial int...
Abstract: Successor Features stand at the boundary between modelfree and model-based Reinforcement L...
International audienceAugmenting the representation of the current state of the external world with ...
Memory lies at the heart of human cognitive abilities. Therefore, understanding it from neural, psyc...
Reinforcement learning systems usually assume that a value function is defined over all states (or s...
Elements from cognitive psychology have been applied in a variety of ways to artificial intelligence...
Online reinforcement learning agents are currently able to process an increasing amount of data by c...
Deep reinforcement learning has shown great potential in training dialogue policies. However, its fa...
The process for transferring knowledge of multiple reinforcement learning policies into a single mul...
For decades, neuroscientists and psychologists have observed that animal performance on spatial navi...
Recently, neuro-inspired episodic control (EC) methods have been developed to overcome the data-inef...