Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning

Sutton, Richard S.
Precup, Doina
Singh, Satinder

Open PDF

Open link

Publication date

August 1999

DOI

10.1016/S0004-3702(99)00052-1

Publisher

Published by Elsevier B.V.

Abstract

AbstractLearning, planning, and representing knowledge at multiple levels of temporal abstraction are key, longstanding challenges for AI. In this paper we consider how these challenges can be addressed within the mathematical framework of reinforcement learning and Markov decision processes (MDPs). We extend the usual notion of action in this framework to include options—closed-loop policies for taking action over a period of time. Examples of options include picking up an object, going to lunch, and traveling to a distant city, as well as primitive actions such as muscle twitches and joint torques. Overall, we show that options enable temporally abstract knowledge and action to be included in the reinforcement learning framework in a natu...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning

Abstract

Extracted data

Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning

Abstract

Extracted data

Related items

Related items