Reinforcement learning (RL) has demonstrated its superiority in solving sequential decision-making problems. However, heavy dependence on immediate reward feedback impedes the wide application of RL. On the other hand, imitation learning (IL) tackles RL without relying on environmental supervision by leveraging external demonstrations. In practice, however, collecting sufficient expert demonstrations can be prohibitively expensive, yet the quality of demonstrations typically limits the performance of the learning policy. To address a practical scenario, in this work, we propose Self-Adaptive Imitation Learning (SAIL), which, provided with a few demonstrations from a sub-optimal teacher, can perform well in RL tasks with extremely delayed re...
International audienceImitation learning has emerged as a pragmatic alternative to reinforcement lea...
Reinforcement learning from expert demonstrations (RLED) is the intersection of imitation learning w...
Adversarial imitation learning has become a widely used imitation learning framework. The discrimina...
International audienceSelf-imitation learning is a Reinforcement Learning (RL) method that encourage...
Episodic self-imitation learning, a novel self-imitation algorithm with a trajectory selection modul...
Imitation learning (IL) has recently shown impressive performance in training a reinforcement learni...
Imitation Learning (IL) is a popular approach for teaching behavior policies to agents by demonstrat...
Abstract. Imitation learning is an effective strategy to reinforcement learning, which avoids the de...
The promise of imitation is to facilitate learning by allowing the learner to ob-serve a teacher in ...
Imitation learning refers to a family of learning algorithms enabling the learning agents to learn d...
International audience—Learning from Demonstrations (LfD) is a paradigm by which an apprentice agent...
Sequential decisions and predictions are common problems in natural language processing, robotics, a...
Advances in robotics have resulted in increases both in the availability of robots and also their co...
We study how to effectively leverage expert feedback to learn sequential decision-making policies. W...
When the agent's observations or interactions are delayed, classic reinforcement learning tools usua...
International audienceImitation learning has emerged as a pragmatic alternative to reinforcement lea...
Reinforcement learning from expert demonstrations (RLED) is the intersection of imitation learning w...
Adversarial imitation learning has become a widely used imitation learning framework. The discrimina...
International audienceSelf-imitation learning is a Reinforcement Learning (RL) method that encourage...
Episodic self-imitation learning, a novel self-imitation algorithm with a trajectory selection modul...
Imitation learning (IL) has recently shown impressive performance in training a reinforcement learni...
Imitation Learning (IL) is a popular approach for teaching behavior policies to agents by demonstrat...
Abstract. Imitation learning is an effective strategy to reinforcement learning, which avoids the de...
The promise of imitation is to facilitate learning by allowing the learner to ob-serve a teacher in ...
Imitation learning refers to a family of learning algorithms enabling the learning agents to learn d...
International audience—Learning from Demonstrations (LfD) is a paradigm by which an apprentice agent...
Sequential decisions and predictions are common problems in natural language processing, robotics, a...
Advances in robotics have resulted in increases both in the availability of robots and also their co...
We study how to effectively leverage expert feedback to learn sequential decision-making policies. W...
When the agent's observations or interactions are delayed, classic reinforcement learning tools usua...
International audienceImitation learning has emerged as a pragmatic alternative to reinforcement lea...
Reinforcement learning from expert demonstrations (RLED) is the intersection of imitation learning w...
Adversarial imitation learning has become a widely used imitation learning framework. The discrimina...