Many MDPs exhibit an hierarchical structure where the agent needs to perform various subtasks that are coupled only by a small sub-set of variables containing, notably, shared resources. Previous work has shown how this hierarchical structure can be exploited by solving several sub-MDPs representing the different sub-tasks in different calling contexts, and a root MDP responsible for sequencing and synchronizing the sub-tasks, instead of a huge MDP representing the whole problem. Another important idea used by efficient algorithms for solving flat MDPs, such as (L)AO * and (L)RTDP, is to exploit reachability information and an admissible heuristics in order to accelerate the search by pruning states that cannot be reached from a given start...
This paper presents an algorithm for finding ap-proximately optimal policies in very large Markov de...
Heuristic forward search is currently the dominant paradigm in classical planning. Forward search al...
Dynamic programming is a well-known approach for solving MDPs. In large state spaces, asynchronous v...
Recent algorithms like RTDP and LAO * combine the strength of Heuristic Search (HS) and Dynamic Prog...
We present a heuristic search algorithm for solving first-order MDPs (FOMDPs). Our approach combines...
In problem domains for which an informed admissible heuristic function is not available, one attract...
Forward-chaining heuristic search is a well-established and popular paradigm for domain-independent ...
Planning under uncertainty for multiagent systems can be formalized as a decentralized partially obs...
We propose a heuristic search algorithm for finding optimal policies in a new class of sequential de...
In problem domains where an informative heuristic evaluation function is not known or not easily com...
AbstractWe investigate the use of machine learning to create effective heuristics for search algorit...
The search for finite-state controllers for partially observable Markov decision processes (POMDPs) ...
International audienceDecentralized partially observable Markov decision processes (Dec-POMDPs) are ...
Dynamic programming is a well-known approach for solv-ing MDPs. In large state spaces, asynchronous ...
We present a novel heuristic search framework, called Multi-Heuristic A * (MHA*), that simultaneousl...
This paper presents an algorithm for finding ap-proximately optimal policies in very large Markov de...
Heuristic forward search is currently the dominant paradigm in classical planning. Forward search al...
Dynamic programming is a well-known approach for solving MDPs. In large state spaces, asynchronous v...
Recent algorithms like RTDP and LAO * combine the strength of Heuristic Search (HS) and Dynamic Prog...
We present a heuristic search algorithm for solving first-order MDPs (FOMDPs). Our approach combines...
In problem domains for which an informed admissible heuristic function is not available, one attract...
Forward-chaining heuristic search is a well-established and popular paradigm for domain-independent ...
Planning under uncertainty for multiagent systems can be formalized as a decentralized partially obs...
We propose a heuristic search algorithm for finding optimal policies in a new class of sequential de...
In problem domains where an informative heuristic evaluation function is not known or not easily com...
AbstractWe investigate the use of machine learning to create effective heuristics for search algorit...
The search for finite-state controllers for partially observable Markov decision processes (POMDPs) ...
International audienceDecentralized partially observable Markov decision processes (Dec-POMDPs) are ...
Dynamic programming is a well-known approach for solv-ing MDPs. In large state spaces, asynchronous ...
We present a novel heuristic search framework, called Multi-Heuristic A * (MHA*), that simultaneousl...
This paper presents an algorithm for finding ap-proximately optimal policies in very large Markov de...
Heuristic forward search is currently the dominant paradigm in classical planning. Forward search al...
Dynamic programming is a well-known approach for solving MDPs. In large state spaces, asynchronous v...