University of Technology Sydney. Faculty of Engineering and Information Technology.A promising method of learning from human feedback is reward shaping, where a robot is trained via human-delivered instantaneous rewards. The existing approach, which requires numerous reward signals about the quality of agent’s actions from the human trainer, is based on a number of assumptions about human capabilities. For example, it assumes that humans can provide a precisely correct feedback to an agent’s action, or that they would always prefer to train an agent by means of reward signals, or that they can assess an agent’s actions for any length of training. In this thesis, we have relaxed these assumptions and have addressed two important issues whic...
For agents and robots to become more useful, they must be able to quickly learn from non-technical u...
A long term goal of Interactive Reinforcement Learning is to incorporate non-expert human feedback t...
In this work, we address a relatively unexplored aspect of designing agents that learn from human re...
textRobots and other computational agents are increasingly becoming part of our daily lives. They wi...
While reinforcement learning has led to promising results in robotics, defining an informative rewar...
Thesis (Ph.D.)--University of Washington, 2021Rapid strides made in the development of computing inf...
AbstractWhile Reinforcement Learning (RL) is not traditionally designed for interactive supervisory ...
An important goal in artificial intelligence is to create agents that can both interact naturally wi...
Abstract — In order to be useful in real-world situations, it is critical to allow non-technical use...
Recent successes combine reinforcement learning algorithms and deep neural networks, despite reinfor...
As robots become a mass consumer product, they will need to learn new skills by interacting with typ...
Keeping a human in a robot learning cycle can provide many advantages to improve the learning proces...
Off-the-shelf Reinforcement Learning (RL) algorithms suffer from slow learning performance, partly b...
Reinforcement learning (RL) techniques optimize the accumulated long-term reward of a suitably chose...
In settings without well-defined goals, methods for reward learning allow reinforcement learning age...
For agents and robots to become more useful, they must be able to quickly learn from non-technical u...
A long term goal of Interactive Reinforcement Learning is to incorporate non-expert human feedback t...
In this work, we address a relatively unexplored aspect of designing agents that learn from human re...
textRobots and other computational agents are increasingly becoming part of our daily lives. They wi...
While reinforcement learning has led to promising results in robotics, defining an informative rewar...
Thesis (Ph.D.)--University of Washington, 2021Rapid strides made in the development of computing inf...
AbstractWhile Reinforcement Learning (RL) is not traditionally designed for interactive supervisory ...
An important goal in artificial intelligence is to create agents that can both interact naturally wi...
Abstract — In order to be useful in real-world situations, it is critical to allow non-technical use...
Recent successes combine reinforcement learning algorithms and deep neural networks, despite reinfor...
As robots become a mass consumer product, they will need to learn new skills by interacting with typ...
Keeping a human in a robot learning cycle can provide many advantages to improve the learning proces...
Off-the-shelf Reinforcement Learning (RL) algorithms suffer from slow learning performance, partly b...
Reinforcement learning (RL) techniques optimize the accumulated long-term reward of a suitably chose...
In settings without well-defined goals, methods for reward learning allow reinforcement learning age...
For agents and robots to become more useful, they must be able to quickly learn from non-technical u...
A long term goal of Interactive Reinforcement Learning is to incorporate non-expert human feedback t...
In this work, we address a relatively unexplored aspect of designing agents that learn from human re...