We design a novel approximate policy iteration (API) method suited for learning good domain-specific control knowledge in large relational planning domains. The learned knowledge takes the form of a control policy for a single Markov decision process representing all problem instances of the planning domain. Our learned policies can quickly solve most or all problems within the domains we evaluate. The API methods we adapt move from policy to policy using a combination of policy simulation and inductive policy selection. Previous methods represent policies implicitly, using cost functions combined with greedy look-ahead. We represent policies directly as compact state-action mappings, and thus avoid the often awkward problem of giving any c...
Generalized planning is concerned with the computation of general policies that solve multiple insta...
We consider a novel use of mostly-correct reactive policies. In classical planning, reactive policy ...
We introduce a variant of the classification-based approach to policy iteration which uses a cost-se...
We explore approximate policy iteration, replacing the usual costfunction learning step with a learn...
We describe and evaluate a system for learning domain-specific control knowledge. In particular, giv...
We consider techniques for learning to plan in deterministic and stochastic Artificial Intelligence ...
General-purpose generative planners use domain-independent search heuristics to generate solutions f...
General-purpose generative planners use domain-independent search heuristics to generate solutions f...
This work investigates the application of Evolutionary Computation (EC) to the induction of generali...
International audienceWe present a classification-based policy iteration algorithm, called Direct Po...
Current planners show impressive performance in many real world and artificial domains by using plan...
Abstract: "Intelligent problem solving requires the ability to select actions autonomously from a sp...
As any other problem solving task that employs search, AI Planning needs heuristics to efficiently g...
In complex, dynamic environments, an agent's knowledge of the environment (its domain knowledge) wil...
AbstractQ-Learning is based on value iteration and remains the most popular choice for solving Marko...
Generalized planning is concerned with the computation of general policies that solve multiple insta...
We consider a novel use of mostly-correct reactive policies. In classical planning, reactive policy ...
We introduce a variant of the classification-based approach to policy iteration which uses a cost-se...
We explore approximate policy iteration, replacing the usual costfunction learning step with a learn...
We describe and evaluate a system for learning domain-specific control knowledge. In particular, giv...
We consider techniques for learning to plan in deterministic and stochastic Artificial Intelligence ...
General-purpose generative planners use domain-independent search heuristics to generate solutions f...
General-purpose generative planners use domain-independent search heuristics to generate solutions f...
This work investigates the application of Evolutionary Computation (EC) to the induction of generali...
International audienceWe present a classification-based policy iteration algorithm, called Direct Po...
Current planners show impressive performance in many real world and artificial domains by using plan...
Abstract: "Intelligent problem solving requires the ability to select actions autonomously from a sp...
As any other problem solving task that employs search, AI Planning needs heuristics to efficiently g...
In complex, dynamic environments, an agent's knowledge of the environment (its domain knowledge) wil...
AbstractQ-Learning is based on value iteration and remains the most popular choice for solving Marko...
Generalized planning is concerned with the computation of general policies that solve multiple insta...
We consider a novel use of mostly-correct reactive policies. In classical planning, reactive policy ...
We introduce a variant of the classification-based approach to policy iteration which uses a cost-se...