The TAMER framework, which provides a way for agents to learn to solve tasks using human-generated rewards, has been examined in several small-scale studies, each with a few dozen subjects. In this paper, we present the results of the first large-scale study of TAMER, which was performed at the NEMO science museum in Amsterdam and involved 561 subjects. Our results show for the first time that an agent using TAMER can successfully learn to play Infinite Mario, a challenging reinforcement-learning benchmark problem based on the popular video game, given feedback from both adult (N=209) and child (N=352) trainers. In addition, our study supports prior studies demonstrating the importance of bidirectional feedback and competitive elements in t...
We introduce a new reward function direction for intrinsically motivated reinforcement learning to m...
Abstract — In order to be useful in real-world situations, it is critical to allow non-technical use...
Incorporating human interaction into agent learning yields two crucial benefits. First, human knowle...
The TAMER framework, which provides a way for agents to learn to solve tasks using human-generated r...
Interactive reinforcement learning provides a way for agents to learn to solve tasks from evaluative...
Interactive reinforcement learning provides a way for agents to learn to solve tasks from evaluative...
Learning from rewards generated by a human trainer observing an agent in action has proven to be a p...
Learning from rewards generated by a human trainer observing an agent in action has been proven to b...
In this work, we address a relatively unexplored aspect of designing agents that learn from human re...
In this paper, we address a relatively unexplored aspect of designing agents that learn from human t...
In this paper, we address a relatively unexplored aspect of designing agents that learn from human t...
In this work, we address a relatively unexplored aspect of designing agents that learn from human re...
textRobots and other computational agents are increasingly becoming part of our daily lives. They wi...
While recent advances in deep reinforcement learning have allowed autonomous learning agents to succ...
Cooperation is a widespread phenomenon in nature that has also been a cornerstone in the development...
We introduce a new reward function direction for intrinsically motivated reinforcement learning to m...
Abstract — In order to be useful in real-world situations, it is critical to allow non-technical use...
Incorporating human interaction into agent learning yields two crucial benefits. First, human knowle...
The TAMER framework, which provides a way for agents to learn to solve tasks using human-generated r...
Interactive reinforcement learning provides a way for agents to learn to solve tasks from evaluative...
Interactive reinforcement learning provides a way for agents to learn to solve tasks from evaluative...
Learning from rewards generated by a human trainer observing an agent in action has proven to be a p...
Learning from rewards generated by a human trainer observing an agent in action has been proven to b...
In this work, we address a relatively unexplored aspect of designing agents that learn from human re...
In this paper, we address a relatively unexplored aspect of designing agents that learn from human t...
In this paper, we address a relatively unexplored aspect of designing agents that learn from human t...
In this work, we address a relatively unexplored aspect of designing agents that learn from human re...
textRobots and other computational agents are increasingly becoming part of our daily lives. They wi...
While recent advances in deep reinforcement learning have allowed autonomous learning agents to succ...
Cooperation is a widespread phenomenon in nature that has also been a cornerstone in the development...
We introduce a new reward function direction for intrinsically motivated reinforcement learning to m...
Abstract — In order to be useful in real-world situations, it is critical to allow non-technical use...
Incorporating human interaction into agent learning yields two crucial benefits. First, human knowle...