Singular perturbation techniques allow the derivation of an aggregate model whose solution is asymptotically optimal for Markov decision processes with strong and weak interactions. We develop an algorithm that takes advantage of the asymptotic optimality of the aggregate model in order to compute the solution of the original model. We derive conditions for which the proposed algorithm has better worst case complexity than conventional contraction algorithms. Based on our complexity analysis, we show that the major benefit of aggregation is that the reduced order model is no longer ill conditioned. The reduction in the number of states (due to aggregation) is a secondary benefit. This is a surprising result since intuition would suggest tha...
In this paper, we propose a single sample path based algorithm with state aggregation to optimize th...
Bibliography: p. 41-42.Supported by the Air Force Office of Scientific Research Grant AFOSR-82-0258....
Stochastic orders can be applied to Markov reward models and used to aggregate models, while introdu...
An iterative aggregation procedure is described for solving large scale, finite state, finite action...
190 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2005.Markovian modeling of systems...
190 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2005.Markovian modeling of systems...
In this paper we consider a singular perturbation of order 2 for a Markov decision process with the ...
A straightforward algorithm for the multiple time scale decomposition of singularly perturbed Markov...
The solution of Markov Decision Processes (MDPs) often relies on special properties of the processes...
A straightforward algorithm for the multiple time scale decomposition of singularly perturbed Markov...
This paper provides a systematic method of obtaining reduced-complexity approximations to aggregate ...
This paper provides a systematic method of obtaining reduced-complexity approximations to aggregate ...
In this thesis, the theory of lumpability (strong lumpability and weak lumpability) of irreducible f...
Abstract — In this paper, we investigate the problem of aggregating a given finite-state Markov proc...
AbstractLet P be the transition matrix of a nearly uncoupled Markov chain. The states can be grouped...
In this paper, we propose a single sample path based algorithm with state aggregation to optimize th...
Bibliography: p. 41-42.Supported by the Air Force Office of Scientific Research Grant AFOSR-82-0258....
Stochastic orders can be applied to Markov reward models and used to aggregate models, while introdu...
An iterative aggregation procedure is described for solving large scale, finite state, finite action...
190 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2005.Markovian modeling of systems...
190 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2005.Markovian modeling of systems...
In this paper we consider a singular perturbation of order 2 for a Markov decision process with the ...
A straightforward algorithm for the multiple time scale decomposition of singularly perturbed Markov...
The solution of Markov Decision Processes (MDPs) often relies on special properties of the processes...
A straightforward algorithm for the multiple time scale decomposition of singularly perturbed Markov...
This paper provides a systematic method of obtaining reduced-complexity approximations to aggregate ...
This paper provides a systematic method of obtaining reduced-complexity approximations to aggregate ...
In this thesis, the theory of lumpability (strong lumpability and weak lumpability) of irreducible f...
Abstract — In this paper, we investigate the problem of aggregating a given finite-state Markov proc...
AbstractLet P be the transition matrix of a nearly uncoupled Markov chain. The states can be grouped...
In this paper, we propose a single sample path based algorithm with state aggregation to optimize th...
Bibliography: p. 41-42.Supported by the Air Force Office of Scientific Research Grant AFOSR-82-0258....
Stochastic orders can be applied to Markov reward models and used to aggregate models, while introdu...