Real-world decision problems often involve multiple competing objectives. The Stochastic Shortest Path (SSP) with lexicographic preferences over multiple costs offers an expressive formulation for many practical problems. However, the existing solution methods either lack optimality guarantees or require costly computations over the entire state space. We propose the first heuristic algorithm for this problem, based on the heuristic algorithm for Constrained SSPs. Our experiments show that our heuristic search algorithm can compute optimal policies while avoiding a large portion of the state space. We further analyze the theoretical properties of the problem, showing the conditions under which SSPs with lexicographic preferences have a ...
Sequential decision problems for real-world applications often need to be solved in real-time, requi...
Stochastic Shortest Path problems (SSPs) can be efficiently dealt with by the Real-Time Dynamic Prog...
International audienceWe consider the objective of computing an ε-optimal policy in a stochastic sho...
We consider the problem of generating optimal stochastic policies for Constrained Stochastic Shortes...
We consider the problem of generating optimal stochastic policies for Constrained Stochastic Shortes...
Heuristic search is a powerful approach that has successfully been applied to a broad class of plann...
Stochastic shortest-path problems (SSP) are an important subclass of MDPs for which heuristic search...
Fully observable decision-theoretic planning problems are commonly modeled as stochastic shortest pa...
Shortest Path Problems (SPP) are one of the most extensively studied problems in the fields of Artif...
Research in efficient methods for solving infinite-horizon MDPs has so far concentrated primarily on...
We consider recently-derived error bounds that can be used to bound the quality of solutions found b...
Stochastic Shortest Path Problems (SSPs) are a common representation for probabilistic planning prob...
We consider the stochastic shortest path (SSP)problem for succinct Markov decision processes(MDPs), ...
Comunicació presentada a: ICAPS 2011 celebrat de l'11 al 16 de juny de 2011 a Freiburg, Alemanya.Res...
Heuristic search is used to efficiently solve the single-node shortest path problem in weighted gra...
Sequential decision problems for real-world applications often need to be solved in real-time, requi...
Stochastic Shortest Path problems (SSPs) can be efficiently dealt with by the Real-Time Dynamic Prog...
International audienceWe consider the objective of computing an ε-optimal policy in a stochastic sho...
We consider the problem of generating optimal stochastic policies for Constrained Stochastic Shortes...
We consider the problem of generating optimal stochastic policies for Constrained Stochastic Shortes...
Heuristic search is a powerful approach that has successfully been applied to a broad class of plann...
Stochastic shortest-path problems (SSP) are an important subclass of MDPs for which heuristic search...
Fully observable decision-theoretic planning problems are commonly modeled as stochastic shortest pa...
Shortest Path Problems (SPP) are one of the most extensively studied problems in the fields of Artif...
Research in efficient methods for solving infinite-horizon MDPs has so far concentrated primarily on...
We consider recently-derived error bounds that can be used to bound the quality of solutions found b...
Stochastic Shortest Path Problems (SSPs) are a common representation for probabilistic planning prob...
We consider the stochastic shortest path (SSP)problem for succinct Markov decision processes(MDPs), ...
Comunicació presentada a: ICAPS 2011 celebrat de l'11 al 16 de juny de 2011 a Freiburg, Alemanya.Res...
Heuristic search is used to efficiently solve the single-node shortest path problem in weighted gra...
Sequential decision problems for real-world applications often need to be solved in real-time, requi...
Stochastic Shortest Path problems (SSPs) can be efficiently dealt with by the Real-Time Dynamic Prog...
International audienceWe consider the objective of computing an ε-optimal policy in a stochastic sho...