This paper considers Markov decision processes (MDPs) with unbounded rates, as a function of state. We are especially interested in studying structural properties of optimal policies and the value function. A common method to derive such properties is by value iteration applied to the uniformised MDP. However, due to the unboundedness of the rates, uniformisation is not possible, and so value iteration cannot be applied in the way we need. To circumvent this, one can perturb the MDP. Then we need two results for the perturbed sequence of MDPs: 1. there exists a unique solution to the discounted cost optimality equation for each perturbation as well as for the original MDP; 2. if the perturbed sequence of MDPs converges in a suitable manner ...
This work considers denumerable state Markov Decision Chains endowed with a long-run expected averag...
This paper presents an axiomatic approach to finite Markov decision processes where the discount rat...
In this article, we study continuous-time Markov decision processes in Polish spaces. The optimality...
The derivation of structural properties of countable state Markov decision processes (MDPs) is gener...
The derivation of structural properties of countable state Markov decision processes (MDPs) is gener...
summary:In this paper there are considered Markov decision processes (MDPs) that have the discounted...
AbstractThe following optimality principle is established for finite undiscounted or discounted Mark...
This research is interested in optimal control of Markov decision processes ...
AbstractIn this paper, we study discounted Markov decision processes on an uncountable state space. ...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
AbstractThis paper studies the minimizing risk problems in Markov decision processes with countable ...
summary:In this paper, we study continuous time Markov decision processes (CTMDPs) with a denumerabl...
AbstractThis paper deals with the average expected reward criterion for continuous-time Markov decis...
We consider multistage decision processes where criterion function is an expectation of minimum func...
This paper presents sufficient conditions for the existence of stationary optimal policies for avera...
This work considers denumerable state Markov Decision Chains endowed with a long-run expected averag...
This paper presents an axiomatic approach to finite Markov decision processes where the discount rat...
In this article, we study continuous-time Markov decision processes in Polish spaces. The optimality...
The derivation of structural properties of countable state Markov decision processes (MDPs) is gener...
The derivation of structural properties of countable state Markov decision processes (MDPs) is gener...
summary:In this paper there are considered Markov decision processes (MDPs) that have the discounted...
AbstractThe following optimality principle is established for finite undiscounted or discounted Mark...
This research is interested in optimal control of Markov decision processes ...
AbstractIn this paper, we study discounted Markov decision processes on an uncountable state space. ...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
AbstractThis paper studies the minimizing risk problems in Markov decision processes with countable ...
summary:In this paper, we study continuous time Markov decision processes (CTMDPs) with a denumerabl...
AbstractThis paper deals with the average expected reward criterion for continuous-time Markov decis...
We consider multistage decision processes where criterion function is an expectation of minimum func...
This paper presents sufficient conditions for the existence of stationary optimal policies for avera...
This work considers denumerable state Markov Decision Chains endowed with a long-run expected averag...
This paper presents an axiomatic approach to finite Markov decision processes where the discount rat...
In this article, we study continuous-time Markov decision processes in Polish spaces. The optimality...