Learning classical planning strategies with policy gradient

Gomoluch, P
Alrajeh, D
Russo, A

Open PDF

Open link

Publication date

February 2019

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Journal

issn:2334-0843

Language

English

Abstract

A common paradigm in classical planning is heuristic forward search. Forward search planners often rely on simple best-first search which remains fixed throughout the search process. In this paper, we introduce a novel search framework capable of alternating between several forward search approaches while solving a particular planning problem. Selection of the approach is performed using a trainable stochastic policy, mapping the state of the search to a probability distribution over the approaches. This enables using policy gradient to learn search strategies tailored to a specific distributions of planning problems and a selected performance metric, e.g. the IPC score. We instantiate the framework by constructing a policy space consisting...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Learning classical planning strategies with policy gradient

Abstract

Extracted data

Learning classical planning strategies with policy gradient

Abstract

Extracted data

Related items

Related items