Abstract — Learning policies that generalize across multiple tasks is an important and challenging research topic in rein-forcement learning and robotics. Training individual policies for every single potential task is often impractical, especially for continuous task variations, requiring more principled ap-proaches to share and transfer knowledge among similar tasks. We present a novel approach for learning a nonlinear feedback policy that generalizes across multiple tasks. The key idea is to define a parametrized policy as a function of both the state and the task, which allows learning a single policy that generalizes across multiple known and unknown tasks. Applications of our novel approach to reinforcement and imitation learning in r...
In this thesis, we propose a method titled "Task Space Policy Learning (TaSPL)", a novel technique t...
Multirobot domains are a challenge for learning algorithms because they require robots to learn to c...
Policy learning which allows autonomous robots to adapt to novel situations has been a long standing...
Abstract — Learning policies that generalize across multiple tasks is an important and challenging r...
Abstract—Reinforcement learning enables an agent to learn behavior by acquiring experience through t...
Policy gradient algorithms have shown consider-able recent success in solving high-dimensional seque...
Deep imitation learning requires many expert demonstrations, which can be hard to obtain, especially...
Humans utilise a large diversity of control and reasoning methods to solve different robot manipula...
Many robotic tasks are composed of a lot of temporally correlated sub-tasks in a highly complex envi...
Robots are on the verge of becoming ubiquitous. In the form of affordable humanoid toy robots, auton...
Reinforcement learning algorithms are very effective at learning policies (mappings from states to a...
A summary of the state-of-the-art reinforcement learning in robotics is given, in terms of both algo...
Some imitation learning approaches rely on Inverse Reinforcement Learning (IRL) methods, to decode a...
Multi-task policies enable a user to adjust their desired objective or task parameters withouthaving...
Policy Learning approaches are among the best suited methods for high-dimensional, continuous contro...
In this thesis, we propose a method titled "Task Space Policy Learning (TaSPL)", a novel technique t...
Multirobot domains are a challenge for learning algorithms because they require robots to learn to c...
Policy learning which allows autonomous robots to adapt to novel situations has been a long standing...
Abstract — Learning policies that generalize across multiple tasks is an important and challenging r...
Abstract—Reinforcement learning enables an agent to learn behavior by acquiring experience through t...
Policy gradient algorithms have shown consider-able recent success in solving high-dimensional seque...
Deep imitation learning requires many expert demonstrations, which can be hard to obtain, especially...
Humans utilise a large diversity of control and reasoning methods to solve different robot manipula...
Many robotic tasks are composed of a lot of temporally correlated sub-tasks in a highly complex envi...
Robots are on the verge of becoming ubiquitous. In the form of affordable humanoid toy robots, auton...
Reinforcement learning algorithms are very effective at learning policies (mappings from states to a...
A summary of the state-of-the-art reinforcement learning in robotics is given, in terms of both algo...
Some imitation learning approaches rely on Inverse Reinforcement Learning (IRL) methods, to decode a...
Multi-task policies enable a user to adjust their desired objective or task parameters withouthaving...
Policy Learning approaches are among the best suited methods for high-dimensional, continuous contro...
In this thesis, we propose a method titled "Task Space Policy Learning (TaSPL)", a novel technique t...
Multirobot domains are a challenge for learning algorithms because they require robots to learn to c...
Policy learning which allows autonomous robots to adapt to novel situations has been a long standing...