Interactive reinforcement learning can effectively facilitate the agent training via human feedback. However, such methods often require the human teacher to know what is the correct action that the agent should take. In other words, if the human teacher is not always reliable, then it will not be consistently able to guide the agent through its training. In this paper, we propose a more effective interactive reinforcement learning system by introducing multiple trainers, namely Multi-Trainer Interactive Reinforcement Learning (MTIRL), which could aggregate the binary feedback from multiple non-perfect trainers into a more reliable reward for an agent training in a reward-sparse environment. In particular, our trainer feedback aggregation e...
AbstractA major concern in multi-agent coordination is how to select algorithms that can lead agents...
Learning from rewards generated by a human trainer observing an agent in action has been proven to b...
The main contributions in this thesis include the selectively decentralized method in solving multi-...
Interactive reinforcement learning has become an important apprenticeship approach to speed up conve...
A long term goal of Interactive Reinforcement Learning is to incorporate non-expert human feedback t...
Machine learning and artificial intelligence has been a hot topic the last few years, thanks to impr...
Abstract Multi-agent systems are rapidly nding applications in a variety of domains, including robo...
Typically, a reinforcement learning agent interacts with the environment and learns how to select an...
The ability to learn new tasks by sequencing already known skills is an important requirement for fu...
This paper introduces two novel algorithms for learning behaviors from human-provided rewards. The p...
Multiagent systems are rapidly finding applications in a variety of domains, including robotics, dis...
In traditional Reinforcement Learning (RL) [4], a single agent learns to act in an environment by op...
For many problems which would be natural for reinforcement learning, the reward signal is not a sing...
The TAMER framework, which provides a way for agents to learn to solve tasks using human-generated r...
The book begins with a chapter on traditional methods of supervised learning, covering recursive lea...
AbstractA major concern in multi-agent coordination is how to select algorithms that can lead agents...
Learning from rewards generated by a human trainer observing an agent in action has been proven to b...
The main contributions in this thesis include the selectively decentralized method in solving multi-...
Interactive reinforcement learning has become an important apprenticeship approach to speed up conve...
A long term goal of Interactive Reinforcement Learning is to incorporate non-expert human feedback t...
Machine learning and artificial intelligence has been a hot topic the last few years, thanks to impr...
Abstract Multi-agent systems are rapidly nding applications in a variety of domains, including robo...
Typically, a reinforcement learning agent interacts with the environment and learns how to select an...
The ability to learn new tasks by sequencing already known skills is an important requirement for fu...
This paper introduces two novel algorithms for learning behaviors from human-provided rewards. The p...
Multiagent systems are rapidly finding applications in a variety of domains, including robotics, dis...
In traditional Reinforcement Learning (RL) [4], a single agent learns to act in an environment by op...
For many problems which would be natural for reinforcement learning, the reward signal is not a sing...
The TAMER framework, which provides a way for agents to learn to solve tasks using human-generated r...
The book begins with a chapter on traditional methods of supervised learning, covering recursive lea...
AbstractA major concern in multi-agent coordination is how to select algorithms that can lead agents...
Learning from rewards generated by a human trainer observing an agent in action has been proven to b...
The main contributions in this thesis include the selectively decentralized method in solving multi-...