In this paper we present a hybrid system combining techniques from symbolic planning and reinforcement learning. Planning is used to automatically construct task hierarchies for hierarchical models of the behaviours ’ purpose, and to perform intelligent termination improvement when an executing behaviour is no longer appropriate. Reinforcement learning is used to produce concrete implementations of abstractly defined behaviours and to learn the best possible choice of behaviour when plans are ambiguous. Two new hierarchical reinforcement learning algorithms are presented: Planned Hierarchical Semi-Markov Q-Learning (P-HSMQ), a variant of the HSMQ algorithm (Dietterich, 2000b) which uses plan-built task hierarchies, and Teleo-Reactive Q-Lear...
The graphical models paradigm provides a general framework for automatically learning hierarchical m...
Long-horizon manipulation tasks such as stacking represent a longstanding challenge in the field of ...
Many environments involve following rules and tasks; for example, a chef cooking a dish follows a re...
In this thesis we investigate the relationships between the symbolic and sub-symbolic methods used f...
Reinforcement learning (RL) is an area of Machine Learning (ML) concerned with learning how a softwa...
There is an increasing interest in Reinforcement Learning to solve new and more challenging problems...
Reinforcement learning provides a means for autonomous agents to improve their action selection stra...
This thesis addresses the open problem of automatically discovering hierarchical structure in reinfo...
Two common approaches to sequential decision-making are AI planning (AIP) and reinforcement learning...
This paper presents a novel model of reinforcement learning agents. A feature of our learning agent ...
A hierarchical representation of the input-output transition function in a learning system is sugges...
Hierarchical Reinforcement Learning (HRL) algorithms can perform planning at multiple levels of abst...
We consider how to learn Hierarchical Task Networks (HTNs) for planning problems in which both the q...
This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decompos...
We propose a model-based approach to hierarchical reinforcement learning that exploits shared knowle...
The graphical models paradigm provides a general framework for automatically learning hierarchical m...
Long-horizon manipulation tasks such as stacking represent a longstanding challenge in the field of ...
Many environments involve following rules and tasks; for example, a chef cooking a dish follows a re...
In this thesis we investigate the relationships between the symbolic and sub-symbolic methods used f...
Reinforcement learning (RL) is an area of Machine Learning (ML) concerned with learning how a softwa...
There is an increasing interest in Reinforcement Learning to solve new and more challenging problems...
Reinforcement learning provides a means for autonomous agents to improve their action selection stra...
This thesis addresses the open problem of automatically discovering hierarchical structure in reinfo...
Two common approaches to sequential decision-making are AI planning (AIP) and reinforcement learning...
This paper presents a novel model of reinforcement learning agents. A feature of our learning agent ...
A hierarchical representation of the input-output transition function in a learning system is sugges...
Hierarchical Reinforcement Learning (HRL) algorithms can perform planning at multiple levels of abst...
We consider how to learn Hierarchical Task Networks (HTNs) for planning problems in which both the q...
This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decompos...
We propose a model-based approach to hierarchical reinforcement learning that exploits shared knowle...
The graphical models paradigm provides a general framework for automatically learning hierarchical m...
Long-horizon manipulation tasks such as stacking represent a longstanding challenge in the field of ...
Many environments involve following rules and tasks; for example, a chef cooking a dish follows a re...