Two steps reinforcement learning

Fernández, Fernando
Borrajo Millán, Daniel

Open PDF

Open link

Publication date

January 2008

DOI

10.1002/int.20255

Publisher

Wiley Periodicals

ISSN

0884-8173

Abstract

When applying reinforcement learning in domains with very large or continuous state spaces, the experience obtained by the learning agent in the interaction with the environment must be generalized. The generalization methods are usually based on the approximation of the value functions used to compute the action policy and tackled in two different ways. On the one hand by using an approximation of the value functions based on a supervized learning method. On the other hand, by discretizing the environment to use a tabular representation of the value functions. In this work, we propose an algorithm that uses both approaches to use the benefits of both mechanisms, allowing a higher performance. The approach is based on two learning phases. I...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Two steps reinforcement learning

Abstract

Extracted data

Two steps reinforcement learning

Abstract

Extracted data

Related items

Related items