Learning, planning, and representing knowledge at multiple levels of temporal abstraction are key, longstanding challenges for AI. In this paper we consider how these challenges can be addressed within the mathematical framework of reinforcement learning and Markov decision processes (MDPs). We extend the usual notion of action in this framework to include options---closed-loop policies for taking action over a period of time. Examples of options include picking up an object, going to lunch, and traveling to a distant city, as well as primitive actions such as muscle twitches and joint torques. Overall, we show that options enable temporally abstract knowledge and action to be included in the reinforcement learning framework in a natural an...
In this paper we present a new method for reinforcement learning in relational domains. A logical la...
Throughout this thesis, I develop the idea that the problem of learning good temporal abstractions i...
To operate effectively in complex environments learning agents require the ability to form useful ab...
Learning, planning, and representing knowledge at multiple levels of temporal abstraction are key, l...
Learning, planning, and representing knowledge at multiple levels of temporal abstraction are key, l...
AbstractLearning, planning, and representing knowledge at multiple levels of temporal abstraction ar...
AbstractLearning, planning, and representing knowledge at multiple levels of temporal abstraction ar...
Decision making usually involves choosing among different courses of action over a broad range of ti...
Learning, planning, and representing knowledge at multiple levels of temporal abstraction are key ch...
The idea of temporal abstraction, i.e. learning, planning and representing the world at multiple tim...
Fundamental to reinforcement learning, as well as to the theory of systems and control, is the probl...
Temporally extended actions have been proved to enhance the performance of reinforcement learning ag...
AI planning benefits greatly from the use of temporally-extended or macro-actions. Macro-actions al...
Temporal abstraction is key to scaling up learning and planning in reinforcement learning. While pla...
In this paper we present a new method for reinforcement learning in relational domains. A logical la...
In this paper we present a new method for reinforcement learning in relational domains. A logical la...
Throughout this thesis, I develop the idea that the problem of learning good temporal abstractions i...
To operate effectively in complex environments learning agents require the ability to form useful ab...
Learning, planning, and representing knowledge at multiple levels of temporal abstraction are key, l...
Learning, planning, and representing knowledge at multiple levels of temporal abstraction are key, l...
AbstractLearning, planning, and representing knowledge at multiple levels of temporal abstraction ar...
AbstractLearning, planning, and representing knowledge at multiple levels of temporal abstraction ar...
Decision making usually involves choosing among different courses of action over a broad range of ti...
Learning, planning, and representing knowledge at multiple levels of temporal abstraction are key ch...
The idea of temporal abstraction, i.e. learning, planning and representing the world at multiple tim...
Fundamental to reinforcement learning, as well as to the theory of systems and control, is the probl...
Temporally extended actions have been proved to enhance the performance of reinforcement learning ag...
AI planning benefits greatly from the use of temporally-extended or macro-actions. Macro-actions al...
Temporal abstraction is key to scaling up learning and planning in reinforcement learning. While pla...
In this paper we present a new method for reinforcement learning in relational domains. A logical la...
In this paper we present a new method for reinforcement learning in relational domains. A logical la...
Throughout this thesis, I develop the idea that the problem of learning good temporal abstractions i...
To operate effectively in complex environments learning agents require the ability to form useful ab...