The solution of Markov Decision Processes (MDPs) often relies on special properties of the processes. For two-level MDPs, the difference in the rates of state changes of the upper and lower levels has led to limiting or approximate solutions of such problems. In this paper, we solve a two-level MDP without making any assumption on the rates of state changes of the two levels. We first show that such a two-level MDP is a non-standard one where the optimal actions of different states can be related to each other. Then we give assumptions (conditions) under which such a specially constrained MDP can be solved by policy iteration. We further show that the computational effort can be reduced by decomposing the MDP. A two-level MDP with M upper-l...
Colloque avec actes et comité de lecture.In this paper, we present two state aggregation methods, us...
We consider the problem of finding an optimal policy in a Markov decision process that maximises the...
We consider the problem of finding an optimal policy in a Markov decision process that maximises the...
This paper introduces a two-phase approach to solve average cost Markov decision processes, which is...
This note addresses the time aggregation approach to ergodic finite state Markov decision processes ...
We propose a time aggregation approach for the solution of infinite horizon average cost Markov deci...
This paper applies two-phase time aggregation to solve discounted Markov decision processes (MDP). T...
Many problems in discrete event dynamic systems(DEDS) can be modeled as Markov Decision Processes. H...
An iterative aggregation procedure is described for solving large scale, finite state, finite action...
This paper provides new techniques for abstracting the state space of a Markov Decision Process (MD...
This paper proposes a simple analytical model called M time-scale MarkovDecision Process (MMDP) for ...
A Markov decision process (MDP) relies on the notions of state, describing the current situation of ...
Time-average Markov decision problems are considered for the finite state and action spaces. Several...
Time-average Markov decision problems are considered for the finite state and action spaces. Several...
As classical methods are intractable for solving Markov decision processes (MDPs) requiring a large ...
Colloque avec actes et comité de lecture.In this paper, we present two state aggregation methods, us...
We consider the problem of finding an optimal policy in a Markov decision process that maximises the...
We consider the problem of finding an optimal policy in a Markov decision process that maximises the...
This paper introduces a two-phase approach to solve average cost Markov decision processes, which is...
This note addresses the time aggregation approach to ergodic finite state Markov decision processes ...
We propose a time aggregation approach for the solution of infinite horizon average cost Markov deci...
This paper applies two-phase time aggregation to solve discounted Markov decision processes (MDP). T...
Many problems in discrete event dynamic systems(DEDS) can be modeled as Markov Decision Processes. H...
An iterative aggregation procedure is described for solving large scale, finite state, finite action...
This paper provides new techniques for abstracting the state space of a Markov Decision Process (MD...
This paper proposes a simple analytical model called M time-scale MarkovDecision Process (MMDP) for ...
A Markov decision process (MDP) relies on the notions of state, describing the current situation of ...
Time-average Markov decision problems are considered for the finite state and action spaces. Several...
Time-average Markov decision problems are considered for the finite state and action spaces. Several...
As classical methods are intractable for solving Markov decision processes (MDPs) requiring a large ...
Colloque avec actes et comité de lecture.In this paper, we present two state aggregation methods, us...
We consider the problem of finding an optimal policy in a Markov decision process that maximises the...
We consider the problem of finding an optimal policy in a Markov decision process that maximises the...