We consider finite horizon Markov decision processes under performance measures that involve both the mean and the variance of the cumulative reward. We show that either randomized or history-based policies can im-prove performance. We prove that the com-plexity of computing a policy that maximizes the mean reward under a variance constraint is NP-hard for some cases, and strongly NP-hard for others. We finally offer pseudopoly-nomial exact and approximation algorithms. 1
AbstractWe study controller synthesis problems for finite-state Markov decision processes, where the...
What are the functionals of the reward that can be computed and optimized exactly in Markov Decision...
We study controller synthesis problems for finite-state Markov decision processes, where the objecti...
We consider finite horizon Markov decision processes under performance measures that involve both th...
We study the complexity of central controller synthesis problems for finite-state Markov decision pr...
We study the complexity of central controller synthesis problems for finite-state Markov decision pr...
We study the complexity of central controller synthesis problems for finite-state Markov decision pr...
We study the complexity of central controller synthesis problems for finite-state Markov decision pr...
We identify a rich class of finite-horizon Markov decision problems (MDPs) for which the variance of...
In this note, we consider discrete-time Markov decision processes with finite state space. Recalling...
We develop the asymptotic variance for Markov decision processes. Results are provided to express th...
We identify a rich class of finite-horizon Markov decision problems (MDPs) for which the variance of...
summary:This paper deals with a first passage mean-variance problem for semi-Markov decision process...
summary:This paper deals with a first passage mean-variance problem for semi-Markov decision process...
summary:This paper deals with a first passage mean-variance problem for semi-Markov decision process...
AbstractWe study controller synthesis problems for finite-state Markov decision processes, where the...
What are the functionals of the reward that can be computed and optimized exactly in Markov Decision...
We study controller synthesis problems for finite-state Markov decision processes, where the objecti...
We consider finite horizon Markov decision processes under performance measures that involve both th...
We study the complexity of central controller synthesis problems for finite-state Markov decision pr...
We study the complexity of central controller synthesis problems for finite-state Markov decision pr...
We study the complexity of central controller synthesis problems for finite-state Markov decision pr...
We study the complexity of central controller synthesis problems for finite-state Markov decision pr...
We identify a rich class of finite-horizon Markov decision problems (MDPs) for which the variance of...
In this note, we consider discrete-time Markov decision processes with finite state space. Recalling...
We develop the asymptotic variance for Markov decision processes. Results are provided to express th...
We identify a rich class of finite-horizon Markov decision problems (MDPs) for which the variance of...
summary:This paper deals with a first passage mean-variance problem for semi-Markov decision process...
summary:This paper deals with a first passage mean-variance problem for semi-Markov decision process...
summary:This paper deals with a first passage mean-variance problem for semi-Markov decision process...
AbstractWe study controller synthesis problems for finite-state Markov decision processes, where the...
What are the functionals of the reward that can be computed and optimized exactly in Markov Decision...
We study controller synthesis problems for finite-state Markov decision processes, where the objecti...