Abstract. Imitation learning is an effective strategy to reinforcement learning, which avoids the delayed reward problem by learning from mentor-demonstrated trajectories. A limitation for imitation learning is that collecting sufficient qualified demonstrations is quite expensive. In this work, we study how an agent can automatically improve its perfor-mance from a weak policy, by automatically acquiring more demonstra-tions for learning. We propose the LEWE framework to sample tasks for the weak policy to execute, and then learn from the successful trajecto-ries to achieve an improvement. As the sampling strategy is the key to the efficiency of LEWE, we further propose to incorporate active learning for the sampling strategy for LEWE. Exp...
Sample efficiency is crucial for imitation learning methods to be applicable in real-world applicati...
Sample efficiency is crucial for imitation learning methods to be applicable in real-world applicati...
RJCIA 2022National audienceDeep Reinforcement Learning methods require a large amount of data to ach...
International audienceSelf-imitation learning is a Reinforcement Learning (RL) method that encourage...
Abstract. Reinforcement learning techniques are increasingly being used to solve dicult problems in ...
Adversarial imitation learning has become a widely used imitation learning framework. The discrimina...
Reinforcement learning (RL) has demonstrated its superiority in solving sequential decision-making p...
Reinforcement Learning has achieved noticeable success in many fields, such as video game playing, c...
Imitation learning (IL) enables robots to acquire skills quickly by transferring expert knowledge, w...
International audience—Learning from Demonstrations (LfD) is a paradigm by which an apprentice agent...
Heuristic-based active learning (AL) methods are limited when the data distribution of the underlyin...
The promise of imitation is to facilitate learning by allowing the learner to ob-serve a teacher in ...
Advances in robotics have resulted in increases both in the availability of robots and also their co...
Imitation Learning from observation describes policy learning in a similar way to human learning. An...
Many existing imitation learning datasets are collected from multiple demonstrators, each with diffe...
Sample efficiency is crucial for imitation learning methods to be applicable in real-world applicati...
Sample efficiency is crucial for imitation learning methods to be applicable in real-world applicati...
RJCIA 2022National audienceDeep Reinforcement Learning methods require a large amount of data to ach...
International audienceSelf-imitation learning is a Reinforcement Learning (RL) method that encourage...
Abstract. Reinforcement learning techniques are increasingly being used to solve dicult problems in ...
Adversarial imitation learning has become a widely used imitation learning framework. The discrimina...
Reinforcement learning (RL) has demonstrated its superiority in solving sequential decision-making p...
Reinforcement Learning has achieved noticeable success in many fields, such as video game playing, c...
Imitation learning (IL) enables robots to acquire skills quickly by transferring expert knowledge, w...
International audience—Learning from Demonstrations (LfD) is a paradigm by which an apprentice agent...
Heuristic-based active learning (AL) methods are limited when the data distribution of the underlyin...
The promise of imitation is to facilitate learning by allowing the learner to ob-serve a teacher in ...
Advances in robotics have resulted in increases both in the availability of robots and also their co...
Imitation Learning from observation describes policy learning in a similar way to human learning. An...
Many existing imitation learning datasets are collected from multiple demonstrators, each with diffe...
Sample efficiency is crucial for imitation learning methods to be applicable in real-world applicati...
Sample efficiency is crucial for imitation learning methods to be applicable in real-world applicati...
RJCIA 2022National audienceDeep Reinforcement Learning methods require a large amount of data to ach...