Hierarchical Average Reward Reinforcement Learning

Mohammad Ghavamzadeh
Sridhar Mahadevan

Publication date

January 2003

DOI

Abstract

Hierarchical reinforcement learning (HRL) is the study of mechanisms for exploiting the structure of tasks in order to learn more quickly. By decomposing tasks into subtasks, fully or partially specified subtask solutions can be reused in solving tasks at higher levels of abstraction. The theory of semi-Markov decision processes provides a theoretical basis for HRL. Several variant representational schemes based on SMDP models have been studied in previous work, all of which are based on the discrete-time discounted SMDP model. In this approach, policies are learned that maximize the long-term discounted sum of rewards. In this paper we investigate two formulations of HRL based on the average-reward SMDP model, both for discrete time and co...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Hierarchical Average Reward Reinforcement Learning

Abstract

Extracted data

Hierarchical Average Reward Reinforcement Learning

Abstract

Extracted data

Related items

Related items