We consider the problem of generating optimal stochastic policies for Constrained Stochastic Shortest Path problems, which are a natural model for planning under uncertainty for resource-bounded agents with multiple competing objectives. While unconstrained SSPs enjoy a multitude of efficient heuristic search solution methods with the ability to focus on promising areas reachable from the initial state, the state of the art for constrained SSPs revolves around linear and dynamic programming algorithms which explore the entire state space. In this paper, we present i-dual, which, to the best of our knowledge, is the first heuristic search algorithm for constrained SSPs. To concisely represent constraints and efficiently decide their violatio...
We consider the stochastic shortest path (SSP)problem for succinct Markov decision processes(MDPs), ...
Research in efficient methods for solving infinite-horizon MDPs has so far concentrated primarily on...
The knapsack problem (KP) is concerned with the selection of a subset of multiple items with known p...
We consider the problem of generating optimal stochastic policies for Constrained Stochastic Shortes...
Real-world decision problems often involve multiple competing objectives. The Stochastic Shortest P...
For the past 25 years, heuristic search has been used to solve domain-independent probabilistic plan...
Stochastic shortest-path problems (SSP) are an important subclass of MDPs for which heuristic search...
Heuristic search is a powerful approach that has successfully been applied to a broad class of plann...
Stochastic Shortest Path Problems (SSPs) are a common representation for probabilistic planning prob...
We consider recently-derived error bounds that can be used to bound the quality of solutions found b...
Fully observable decision-theoretic planning problems are commonly modeled as stochastic shortest pa...
The Partially Observable Markov Decision Process (POMDP) is widely used in probabilistic planning fo...
Stochastic Shortest Path problems (SSPs) can be efficiently dealt with by the Real-Time Dynamic Prog...
Stochastic Shortest Path problems (SSPs) can be efficiently dealt with by the Real-Time Dynamic Prog...
In this paper, we consider planning in stochastic shortest path problems, a subclass of Markov Decis...
We consider the stochastic shortest path (SSP)problem for succinct Markov decision processes(MDPs), ...
Research in efficient methods for solving infinite-horizon MDPs has so far concentrated primarily on...
The knapsack problem (KP) is concerned with the selection of a subset of multiple items with known p...
We consider the problem of generating optimal stochastic policies for Constrained Stochastic Shortes...
Real-world decision problems often involve multiple competing objectives. The Stochastic Shortest P...
For the past 25 years, heuristic search has been used to solve domain-independent probabilistic plan...
Stochastic shortest-path problems (SSP) are an important subclass of MDPs for which heuristic search...
Heuristic search is a powerful approach that has successfully been applied to a broad class of plann...
Stochastic Shortest Path Problems (SSPs) are a common representation for probabilistic planning prob...
We consider recently-derived error bounds that can be used to bound the quality of solutions found b...
Fully observable decision-theoretic planning problems are commonly modeled as stochastic shortest pa...
The Partially Observable Markov Decision Process (POMDP) is widely used in probabilistic planning fo...
Stochastic Shortest Path problems (SSPs) can be efficiently dealt with by the Real-Time Dynamic Prog...
Stochastic Shortest Path problems (SSPs) can be efficiently dealt with by the Real-Time Dynamic Prog...
In this paper, we consider planning in stochastic shortest path problems, a subclass of Markov Decis...
We consider the stochastic shortest path (SSP)problem for succinct Markov decision processes(MDPs), ...
Research in efficient methods for solving infinite-horizon MDPs has so far concentrated primarily on...
The knapsack problem (KP) is concerned with the selection of a subset of multiple items with known p...