This paper proposes model-free imitation learning named Entropy-Regularized Imitation Learning (ERIL) that minimizes the reverse Kullback-Leibler (KL) divergence. ERIL combines forward and inverse reinforcement learning (RL) under the framework of an entropy-regularized Markov decision process. An inverse RL step computes the log-ratio between two distributions by evaluating two binary discriminators. The first discriminator distinguishes the state generated by the forward RL step from the expert's state. The second discriminator, which is structured by the theory of entropy regularization, distinguishes the state-action-next-state tuples generated by the learner from the expert ones. One notable feature is that the second discriminator sha...
The goal of the inverse reinforcement learning (IRL) problem is to recover the reward functions from...
We develop a simple framework to learn bio-inspired foraging policies using human data. We conduct a...
A major challenge faced by machine learning community is the decision making problems under uncertai...
This paper proposes model-free imitation learning named Entropy-Regularized Imitation Learning (ERIL...
We present an algorithm for Inverse Reinforcement Learning (IRL) from expert state observations only...
Various methods for solving the inverse reinforcement learning (IRL) problem have been developed ind...
We consider the problem of imitation learning where the examples, demonstrated by an expert, cover o...
Inverse Reinforcement Learning (IRL) is attractive in scenarios where reward engineering can be tedi...
In many sequential decision-making problems (e.g., robotics control, game playing, sequential predic...
© 2020, Springer Science+Business Media, LLC, part of Springer Nature. Inverse reinforcement learnin...
Robots had a great impact on the manufacturing industry ever since the early seventies when companie...
We provide new perspectives and inference algorithms for Maximum Entropy (MaxEnt) Inverse Reinforcem...
The aim of Inverse Reinforcement Learning (IRL) is to infer a reward function R from a policy pi. To...
As robots and other autonomous agents enter our homes, hospitals, schools, and workplaces, it is imp...
Inverse reinforcement learning (IRL) seeks to learn the reward function from expert trajectories, to...
The goal of the inverse reinforcement learning (IRL) problem is to recover the reward functions from...
We develop a simple framework to learn bio-inspired foraging policies using human data. We conduct a...
A major challenge faced by machine learning community is the decision making problems under uncertai...
This paper proposes model-free imitation learning named Entropy-Regularized Imitation Learning (ERIL...
We present an algorithm for Inverse Reinforcement Learning (IRL) from expert state observations only...
Various methods for solving the inverse reinforcement learning (IRL) problem have been developed ind...
We consider the problem of imitation learning where the examples, demonstrated by an expert, cover o...
Inverse Reinforcement Learning (IRL) is attractive in scenarios where reward engineering can be tedi...
In many sequential decision-making problems (e.g., robotics control, game playing, sequential predic...
© 2020, Springer Science+Business Media, LLC, part of Springer Nature. Inverse reinforcement learnin...
Robots had a great impact on the manufacturing industry ever since the early seventies when companie...
We provide new perspectives and inference algorithms for Maximum Entropy (MaxEnt) Inverse Reinforcem...
The aim of Inverse Reinforcement Learning (IRL) is to infer a reward function R from a policy pi. To...
As robots and other autonomous agents enter our homes, hospitals, schools, and workplaces, it is imp...
Inverse reinforcement learning (IRL) seeks to learn the reward function from expert trajectories, to...
The goal of the inverse reinforcement learning (IRL) problem is to recover the reward functions from...
We develop a simple framework to learn bio-inspired foraging policies using human data. We conduct a...
A major challenge faced by machine learning community is the decision making problems under uncertai...