We introduce an algorithm for model-based hierarchical reinforcement learning to acquire self-contained transition and reward models suitable for probabilistic planning at multiple levels of abstraction. We call this framework Planning with Abstract Learned Models (PALM). By representing subtasks symbolically using a new formal structure, the lifted abstract Markov decision process (L-AMDP), PALM learns models that are independent and modular. Through our experiments, we show how PALM integrates planning and execution, facilitating a rapid and efficient learning of abstract, hierarchical models. We also demonstrate the increased potential for learned models to be transferred to new and related tasks
The application of reinforcement learning (RL) algorithms is often hindered by the combinatorial exp...
This thesis is concerned with the problem of how to make decisions in an uncertain world. We use a ...
Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, us...
Reinforcement learning (RL) is an area of Machine Learning (ML) concerned with learning how a softwa...
We propose a model-based approach to hierarchical reinforcement learning that exploits shared knowle...
In the quest for efficient and robust reinforcement learning methods, both model-free and model-base...
Two common approaches to sequential decision-making are AI planning (AIP) and reinforcement learning...
This paper presents a new method for the autonomous construction of hierarchical action and state re...
Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a k...
In this paper we present a hybrid system combining techniques from symbolic planning and reinforceme...
Abstract. Reinforcement learning is bedeviled by the curse of dimensionality: the number of paramete...
To operate effectively in complex environments learning agents require the ability to form useful ab...
This thesis addresses the open problem of automatically discovering hierarchical structure in reinfo...
Common approaches to Reinforcement Learning (RL) are seriously challenged by large-scale application...
Deep reinforcement learning methods are capable of learning complex heuristics starting with no prio...
The application of reinforcement learning (RL) algorithms is often hindered by the combinatorial exp...
This thesis is concerned with the problem of how to make decisions in an uncertain world. We use a ...
Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, us...
Reinforcement learning (RL) is an area of Machine Learning (ML) concerned with learning how a softwa...
We propose a model-based approach to hierarchical reinforcement learning that exploits shared knowle...
In the quest for efficient and robust reinforcement learning methods, both model-free and model-base...
Two common approaches to sequential decision-making are AI planning (AIP) and reinforcement learning...
This paper presents a new method for the autonomous construction of hierarchical action and state re...
Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a k...
In this paper we present a hybrid system combining techniques from symbolic planning and reinforceme...
Abstract. Reinforcement learning is bedeviled by the curse of dimensionality: the number of paramete...
To operate effectively in complex environments learning agents require the ability to form useful ab...
This thesis addresses the open problem of automatically discovering hierarchical structure in reinfo...
Common approaches to Reinforcement Learning (RL) are seriously challenged by large-scale application...
Deep reinforcement learning methods are capable of learning complex heuristics starting with no prio...
The application of reinforcement learning (RL) algorithms is often hindered by the combinatorial exp...
This thesis is concerned with the problem of how to make decisions in an uncertain world. We use a ...
Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, us...