Despite the recent success of reinforcement learning in various domains, these approaches remain, for the most part, deterringly sensitive to hyper-parameters and are often riddled with essential engineering feats allowing their success. We consider the case of off-policy generative adversarial imitation learning, and perform an in-depth review, qualitative and quantitative, of the method. We show that forcing the learned reward function to be local Lipschitz-continuous is a sine qua non condition for the method to perform well. We then study the effects of this necessary condition and provide several theoretical results involving the local Lipschitzness of the state-value function. We complement these guarantees with empirical evidence att...
Imitation Learning (IL) is an important paradigm within the broader reinforcement learning (RL) meth...
RJCIA 2022National audienceDeep Reinforcement Learning methods require a large amount of data to ach...
We present an algorithm for Inverse Reinforcement Learning (IRL) from expert state observations only...
Despite the recent success of reinforcement learning in various domains, these approaches remain, fo...
Despite the recent success of reinforcement learning in various domains, these approaches remain, fo...
Modelling a reward able to convey the right incentive to the agent is fairly tedious in terms of eng...
Imitation learning refers to a family of learning algorithms enabling the learning agents to learn d...
As robots and other autonomous agents enter our homes, hospitals, schools, and workplaces, it is imp...
This paper proposes a new regularization technique for reinforcement learning (RL) towards making po...
We consider the problem of imitation learning from a finite set of expert trajectories, without acce...
Imitation learning refers to the problem where an agent learns a policy that mimics the demonstratio...
The introduction of the generative adversarial imitation learning (GAIL) algorithm has spurred the d...
A common problem in Reinforcement Learning (RL) is that the reward function is hard to express. This...
Imitation learning (IL) has recently shown impressive performance in training a reinforcement learni...
Imitation learning is the task of replicating expert policy from demonstrations, without access to a...
Imitation Learning (IL) is an important paradigm within the broader reinforcement learning (RL) meth...
RJCIA 2022National audienceDeep Reinforcement Learning methods require a large amount of data to ach...
We present an algorithm for Inverse Reinforcement Learning (IRL) from expert state observations only...
Despite the recent success of reinforcement learning in various domains, these approaches remain, fo...
Despite the recent success of reinforcement learning in various domains, these approaches remain, fo...
Modelling a reward able to convey the right incentive to the agent is fairly tedious in terms of eng...
Imitation learning refers to a family of learning algorithms enabling the learning agents to learn d...
As robots and other autonomous agents enter our homes, hospitals, schools, and workplaces, it is imp...
This paper proposes a new regularization technique for reinforcement learning (RL) towards making po...
We consider the problem of imitation learning from a finite set of expert trajectories, without acce...
Imitation learning refers to the problem where an agent learns a policy that mimics the demonstratio...
The introduction of the generative adversarial imitation learning (GAIL) algorithm has spurred the d...
A common problem in Reinforcement Learning (RL) is that the reward function is hard to express. This...
Imitation learning (IL) has recently shown impressive performance in training a reinforcement learni...
Imitation learning is the task of replicating expert policy from demonstrations, without access to a...
Imitation Learning (IL) is an important paradigm within the broader reinforcement learning (RL) meth...
RJCIA 2022National audienceDeep Reinforcement Learning methods require a large amount of data to ach...
We present an algorithm for Inverse Reinforcement Learning (IRL) from expert state observations only...