Cooperative trajectory planning methods for automated vehicles can solve traffic scenarios that require a high degree of cooperation between traffic participants. However, for cooperative systems to integrate into human-centered traffic, the automated systems must behave human-like so that humans can anticipate the system's decisions. While Reinforcement Learning has made remarkable progress in solving the decision-making part, it is non-trivial to parameterize a reward model that yields predictable actions. This work employs feature-based Maximum Entropy Inverse Reinforcement Learning combined with Monte Carlo Tree Search to learn reward models that maximize the likelihood of recorded multi-agent cooperative expert trajectories. The evalua...
Abstract In a large variety of situations one would like to have an expressive and accurate model of...
Recent research has shown the benefit of framing problems of imitation learning as solutions to Mark...
Based on the premise that the most succinct representation of the behavior of an entity is its rewar...
Modeling possible future outcomes of robot-human interactions is of importance in the intelligent ve...
This work handles the inverse reinforcement learning (IRL) problem where only a small number of demo...
We make decisions to maximize our perceived reward, but handcrafting a reward function for an autono...
In traditional Reinforcement Learning (RL) [4], a single agent learns to act in an environment by op...
International audienceA popular approach to apprenticeship learning (AL) is to formulate itas an inv...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
We provide new perspectives and inference algorithms for Maximum Entropy (MaxEnt) Inverse Reinforcem...
One of the fundamental problems of artificial intelligence is learning how to behave optimally. With...
Existing inverse reinforcement learning (IRL) algorithms have assumed each expert’s demonstrated tra...
We study the problem of designing autonomous agents that can learn to cooperate effectively with a p...
Learning desirable behavior from a limited number of demonstrations, also known as inverse reinforce...
Autonomous unmanned vehicles (UxVs) can be useful in many scenarios including disaster relief, prod...
Abstract In a large variety of situations one would like to have an expressive and accurate model of...
Recent research has shown the benefit of framing problems of imitation learning as solutions to Mark...
Based on the premise that the most succinct representation of the behavior of an entity is its rewar...
Modeling possible future outcomes of robot-human interactions is of importance in the intelligent ve...
This work handles the inverse reinforcement learning (IRL) problem where only a small number of demo...
We make decisions to maximize our perceived reward, but handcrafting a reward function for an autono...
In traditional Reinforcement Learning (RL) [4], a single agent learns to act in an environment by op...
International audienceA popular approach to apprenticeship learning (AL) is to formulate itas an inv...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
We provide new perspectives and inference algorithms for Maximum Entropy (MaxEnt) Inverse Reinforcem...
One of the fundamental problems of artificial intelligence is learning how to behave optimally. With...
Existing inverse reinforcement learning (IRL) algorithms have assumed each expert’s demonstrated tra...
We study the problem of designing autonomous agents that can learn to cooperate effectively with a p...
Learning desirable behavior from a limited number of demonstrations, also known as inverse reinforce...
Autonomous unmanned vehicles (UxVs) can be useful in many scenarios including disaster relief, prod...
Abstract In a large variety of situations one would like to have an expressive and accurate model of...
Recent research has shown the benefit of framing problems of imitation learning as solutions to Mark...
Based on the premise that the most succinct representation of the behavior of an entity is its rewar...