Computation of a satisfactory control policy for a Markov decision process when the parameters of the model are not exactly known is a problem encountered in many practical applications. The traditional robust approach is based on a worst-case analysis and may lead to an overly conservative policy. In this paper we con-sider the tradeoff between nominal performance and the worst case performance over all possible models. Based on parametric linear programming, we propose a method that computes the whole set of Pareto efficient policies in the performance-robustness plane when only the reward parameters are subject to uncertainty. In the more general case when the transition probabilities are also subject to error, we show that the strategy ...
The authors consider the fundamental problem of nding good poli-cies in uncertain models. It is demo...
Markov decision processes (MDPs) are a common approach to model dynamic optimization problems in man...
Markov Decision Processes (MDPs) constitute a mathematical framework for modelling systems featuring...
In robust Markov decision processes (MDPs), the uncertainty in the transition kernel is addressed by...
Optimal solutions to Markov decision problems may be very sensitive with respect to the state transi...
Markov decision processes (MDP) is a standard modeling tool for sequential decision making in a dyna...
Markov decision processes (MDPs) are powerful tools for decision making in uncertain dynamic environ...
In this paper, we seek robust policies for uncertain Markov Decision Processes (MDPs). Most robust o...
The authors consider the fundamental problem of finding good policies in uncertain models. It is dem...
Abstract. Bounded parameter Markov Decision Processes (BMDPs) address the issue of dealing with unce...
Abstract — We consider decision making in a Markovian setup where the reward parameters are not know...
We study the synthesis of robust optimal control policies for Markov decision processes with transit...
Decision making formulated as finding a strategy that maximizes a utility function depends critic...
The authors consider the fundamental problem of nding good poli-cies in uncertain models. It is demo...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Sloan School of Management, Operations Resea...
The authors consider the fundamental problem of nding good poli-cies in uncertain models. It is demo...
Markov decision processes (MDPs) are a common approach to model dynamic optimization problems in man...
Markov Decision Processes (MDPs) constitute a mathematical framework for modelling systems featuring...
In robust Markov decision processes (MDPs), the uncertainty in the transition kernel is addressed by...
Optimal solutions to Markov decision problems may be very sensitive with respect to the state transi...
Markov decision processes (MDP) is a standard modeling tool for sequential decision making in a dyna...
Markov decision processes (MDPs) are powerful tools for decision making in uncertain dynamic environ...
In this paper, we seek robust policies for uncertain Markov Decision Processes (MDPs). Most robust o...
The authors consider the fundamental problem of finding good policies in uncertain models. It is dem...
Abstract. Bounded parameter Markov Decision Processes (BMDPs) address the issue of dealing with unce...
Abstract — We consider decision making in a Markovian setup where the reward parameters are not know...
We study the synthesis of robust optimal control policies for Markov decision processes with transit...
Decision making formulated as finding a strategy that maximizes a utility function depends critic...
The authors consider the fundamental problem of nding good poli-cies in uncertain models. It is demo...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Sloan School of Management, Operations Resea...
The authors consider the fundamental problem of nding good poli-cies in uncertain models. It is demo...
Markov decision processes (MDPs) are a common approach to model dynamic optimization problems in man...
Markov Decision Processes (MDPs) constitute a mathematical framework for modelling systems featuring...