Recent machine-learning approaches to deterministic search and domain-independent planning employ policy learning to speed up search. Unfortunately, when attempting to solve a search problem by successively applying a policy, no guarantees can be given on solution quality. The problem of how to effectively use a learned policy within a bounded-suboptimal search algorithm remains largely as an open question. In this paper, we propose various ways in which such policies can be integrated into Focal Search, assuming that the policy is a neural network classifier. Furthermore, we provide mathematical foundations for some of the resulting algorithms. To evaluate the resulting algorithms over a number of policies with varying accuracy, we use s...
Work in machine learning has grown tremendously in the past years, but has had little to no impact o...
Heuristic forward search is currently the dominant paradigmin classical planning. Forward search alg...
We consider the policy search approach to reinforcement learning. We show that if a “baseline distri...
Machine Learning (ML) has made significant progress to perform different tasks, such as image classi...
Heuristic forward search is currently the dominant paradigm in classical planning. Forward search al...
Direct policy search methods offer the promise of automatically learning controllers for com-plex, h...
Many current state-of-the-art planners rely on forward heuristic search. The success of such search ...
Graduation date: 2011This dissertation explores algorithms for learning ranking functions to efficie...
We present a policy search method that uses iteratively refitted local linear models to optimize tra...
A major difficulty in a search-based problem-solving process is the task of searching the potentiall...
Heuristic search algorithms are widely used in both AI planning and the decoding of sequences from d...
A popular approach for online decision making in large MDPs is time-bounded tree search. The effecti...
Abstract—We consider the problem of learning to locate targets from demonstrated searches. In this c...
Feature Selection techniques usually follow some search strategy to select a suitable subset from a ...
We consider a novel use of mostly-correct reactive policies. In classical planning, reactive policy ...
Work in machine learning has grown tremendously in the past years, but has had little to no impact o...
Heuristic forward search is currently the dominant paradigmin classical planning. Forward search alg...
We consider the policy search approach to reinforcement learning. We show that if a “baseline distri...
Machine Learning (ML) has made significant progress to perform different tasks, such as image classi...
Heuristic forward search is currently the dominant paradigm in classical planning. Forward search al...
Direct policy search methods offer the promise of automatically learning controllers for com-plex, h...
Many current state-of-the-art planners rely on forward heuristic search. The success of such search ...
Graduation date: 2011This dissertation explores algorithms for learning ranking functions to efficie...
We present a policy search method that uses iteratively refitted local linear models to optimize tra...
A major difficulty in a search-based problem-solving process is the task of searching the potentiall...
Heuristic search algorithms are widely used in both AI planning and the decoding of sequences from d...
A popular approach for online decision making in large MDPs is time-bounded tree search. The effecti...
Abstract—We consider the problem of learning to locate targets from demonstrated searches. In this c...
Feature Selection techniques usually follow some search strategy to select a suitable subset from a ...
We consider a novel use of mostly-correct reactive policies. In classical planning, reactive policy ...
Work in machine learning has grown tremendously in the past years, but has had little to no impact o...
Heuristic forward search is currently the dominant paradigmin classical planning. Forward search alg...
We consider the policy search approach to reinforcement learning. We show that if a “baseline distri...