Approximate Policy Iteration with a Policy Language Bias: Learning Control Knowledge Planning in Planning Domains

Fern, Alan
Yoon, Sungwook
Givan, Robert

Open PDF

Open link

Publication date

May 2003

Publisher

Purdue University (bepress)

Language

English

Abstract

We design a novel approximate policy iteration (API) method suited for learning good domain-specific control knowledge in large relational planning domains. The learned knowledge takes the form of a control policy for a single Markov decision process representing all problem instances of the planning domain. Our learned policies can quickly solve most or all problems within the domains we evaluate. The API methods we adapt move from policy to policy using a combination of policy simulation and inductive policy selection. Previous methods represent policies implicitly, using cost functions combined with greedy look-ahead. We represent policies directly as compact state-action mappings, and thus avoid the often awkward problem of giving any c...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Approximate Policy Iteration with a Policy Language Bias: Learning Control Knowledge Planning in Planning Domains

Abstract

Extracted data

Approximate Policy Iteration with a Policy Language Bias: Learning Control Knowledge Planning in Planning Domains

Abstract

Extracted data

Related items

Related items