We formulate the problem of scheduling a single server in a multi-class queueing system as a Markov decision process under the discounted cost and the average cost criteria. We develop a new implementation of the modified policy iteration (MPI) dynamic programming algorithm to efficiently solve problems with large state spaces and small action spaces. This implementation has an enhanced policy evaluation (PE) step and an adaptive termination test. To numerically evaluate various solution approaches, we implemented value iteration and forms of modified policy iteration, and we further developed and implemented aggregation-disaggregation based (replacement process decomposition and group-scaling) algorithms appropriate to controlled queueing ...
Abstract — This paper describes a novel algorithm called CON-MODP for computing Pareto optimal polic...
We consider a discounted Markov Decision Process (MDP) supplemented with the requirement that anothe...
We consider the problem of finding an optimal policy in a Markov decision process that maximises the...
We give a closed-form expression for the discounted weighted queue length and switching costs of a t...
Dynamic programming (DP) is one of the most important mathematical programming methods. However, a m...
The running time of the classical algorithms of the Markov Decision Process (MDP) typically grows li...
International audienceWe consider a class of Markov Decision Processes frequently employed to model ...
We introduce a new algorithm based on linear programming for optimization of average-cost Markov dec...
This paper studies a class of queueing control problems involving commonly used control mechanisms s...
In the previous works, we have shown that policy iteration algorithms in performance optimization fo...
In this talk we consider queueing systems which are subject to control (e.g. admission control, rout...
This research focuses on Markov Decision Processes (MDP). MDP is one of the most important and chall...
Modified policy iteration (MPI) is a dynamic programming (DP) algorithm that contains the two celebr...
Recent research indicates that perturbation analysis (PA), Markov decision processes (MDP), and rein...
We consider the classical finite-state discounted Markovian decision problem, and we introduce a new...
Abstract — This paper describes a novel algorithm called CON-MODP for computing Pareto optimal polic...
We consider a discounted Markov Decision Process (MDP) supplemented with the requirement that anothe...
We consider the problem of finding an optimal policy in a Markov decision process that maximises the...
We give a closed-form expression for the discounted weighted queue length and switching costs of a t...
Dynamic programming (DP) is one of the most important mathematical programming methods. However, a m...
The running time of the classical algorithms of the Markov Decision Process (MDP) typically grows li...
International audienceWe consider a class of Markov Decision Processes frequently employed to model ...
We introduce a new algorithm based on linear programming for optimization of average-cost Markov dec...
This paper studies a class of queueing control problems involving commonly used control mechanisms s...
In the previous works, we have shown that policy iteration algorithms in performance optimization fo...
In this talk we consider queueing systems which are subject to control (e.g. admission control, rout...
This research focuses on Markov Decision Processes (MDP). MDP is one of the most important and chall...
Modified policy iteration (MPI) is a dynamic programming (DP) algorithm that contains the two celebr...
Recent research indicates that perturbation analysis (PA), Markov decision processes (MDP), and rein...
We consider the classical finite-state discounted Markovian decision problem, and we introduce a new...
Abstract — This paper describes a novel algorithm called CON-MODP for computing Pareto optimal polic...
We consider a discounted Markov Decision Process (MDP) supplemented with the requirement that anothe...
We consider the problem of finding an optimal policy in a Markov decision process that maximises the...