source code available at https://gitlab.inria.fr/robotlearn/xi_learningTransfer in Reinforcement Learning aims to improve learning performance on target tasks using knowledge from experienced source tasks. Successor Representations (SR) and their extension Successor Features (SF) are prominent transfer mechanisms in domains where reward functions change between tasks. They reevaluate the expected return of previously learned policies in a new target task to transfer their knowledge. The SF framework extended SR by linearly decomposing rewards into successor features and a reward weight vector allowing their application in high-dimensional tasks. But this came with the cost of having a linear relationship between reward functions and success...
International audienceA longstanding goal in reinforcement learning is to build intelligent agents t...
In this paper we study the question of life long learning of behaviors from human demonstrations by ...
In reinforcement learning, reward is used to guide the learning process. The reward is often designe...
source code available at https://gitlab.inria.fr/robotlearn/xi_learningTransfer in Reinforcement Lea...
Abstract: Successor Features stand at the boundary between modelfree and model-based Reinforcement L...
Transfer in Reinforcement Learning aims to improve learning performance on target tasks using knowle...
Successor-style representations have many advantages for reinforcement learning: for example, they c...
A common setting in multitask reinforcement learning (RL) demands that an agent rapidly adapt to var...
In this paper we introduce a simple approach for exploration in reinforcement learning (RL) that all...
Transfer in reinforcement learning refers to the notion that generalization should occur not only wi...
Theories of reward learning in neuroscience have focused on two families of algorithms, thought to c...
National audienceAccounting for behavioural capabilities and flexibilities experimentally observed i...
In reinforcement learning, temporal difference-based algorithms can be sample-inefficient: for insta...
The successor representation was introduced into reinforcement learning by Dayan (1993) as a means o...
The goal of inverse reinforcement learning is to find a reward function for a Markov decision proces...
International audienceA longstanding goal in reinforcement learning is to build intelligent agents t...
In this paper we study the question of life long learning of behaviors from human demonstrations by ...
In reinforcement learning, reward is used to guide the learning process. The reward is often designe...
source code available at https://gitlab.inria.fr/robotlearn/xi_learningTransfer in Reinforcement Lea...
Abstract: Successor Features stand at the boundary between modelfree and model-based Reinforcement L...
Transfer in Reinforcement Learning aims to improve learning performance on target tasks using knowle...
Successor-style representations have many advantages for reinforcement learning: for example, they c...
A common setting in multitask reinforcement learning (RL) demands that an agent rapidly adapt to var...
In this paper we introduce a simple approach for exploration in reinforcement learning (RL) that all...
Transfer in reinforcement learning refers to the notion that generalization should occur not only wi...
Theories of reward learning in neuroscience have focused on two families of algorithms, thought to c...
National audienceAccounting for behavioural capabilities and flexibilities experimentally observed i...
In reinforcement learning, temporal difference-based algorithms can be sample-inefficient: for insta...
The successor representation was introduced into reinforcement learning by Dayan (1993) as a means o...
The goal of inverse reinforcement learning is to find a reward function for a Markov decision proces...
International audienceA longstanding goal in reinforcement learning is to build intelligent agents t...
In this paper we study the question of life long learning of behaviors from human demonstrations by ...
In reinforcement learning, reward is used to guide the learning process. The reward is often designe...