Periodicity is a simple form of nonstationarity in Markov decision processes. In this paper successive approximations are considered for discounted and undiscounted periodic Markov decision processes. For this type of process iteration steps can be performed without any loss of efficiency, provided the type of procedure is sensibly chosen. Moreover, and most important, it is possible to derive sharp bounds for the value function. Numerical evidence is provided
A method of successive approximations for discountedMarkovian decision problems is described byMacQu...
AbstractMost quantities of interest in discounted and undiscounted (semi-) Markov decision processes...
In this paper we consider a completely ergodic Markov decision process with finite state and decisio...
Periodicity is a simple form of nonstationarity in Markov decision processes. In this paper successi...
In this paper we will consider several variants of the standard successive approximation technique f...
The aim of this paper is to give an overview of recent developments in the area of successive approx...
Successive Approximation (S.A.) methods, for solving discounted Markov decision problems, have been ...
Markov chains with periodic graphs arise frequently in a wide range of modelling experiments. Applic...
Markov decision processes which allow for an unbounded reward structure are considered. Conditions a...
For the numerical analysis of Markov decision processes quite a lot of algorithms have been presente...
A method of successive approximations for discountedMarkovian decision problems is described byMacQu...
AbstractMost quantities of interest in discounted and undiscounted (semi-) Markov decision processes...
In this paper we consider a completely ergodic Markov decision process with finite state and decisio...
Periodicity is a simple form of nonstationarity in Markov decision processes. In this paper successi...
In this paper we will consider several variants of the standard successive approximation technique f...
The aim of this paper is to give an overview of recent developments in the area of successive approx...
Successive Approximation (S.A.) methods, for solving discounted Markov decision problems, have been ...
Markov chains with periodic graphs arise frequently in a wide range of modelling experiments. Applic...
Markov decision processes which allow for an unbounded reward structure are considered. Conditions a...
For the numerical analysis of Markov decision processes quite a lot of algorithms have been presente...
A method of successive approximations for discountedMarkovian decision problems is described byMacQu...
AbstractMost quantities of interest in discounted and undiscounted (semi-) Markov decision processes...
In this paper we consider a completely ergodic Markov decision process with finite state and decisio...