The paper investigates stochastic resource allocation problems with scarce, reusable resources and non-preemtive, time-dependent, interconnected tasks. This approach is a natural generalization of several standard resource management problems, such as schedul-ing and transportation problems. First, reactive solutions are considered and defined as control policies of suitably reformulated Markov decision processes (MDPs). We argue that this reformulation has several favorable properties, such as it has finite state and action spaces, it is aperiodic, hence all policies are proper and the space of control policies can be safely restricted. Next, approximate dynamic programming (ADP) methods, such as fitted Q-learning, are suggested for comput...
A stochastic resource allocation model, based on the principles of Markov decision processes (MDPs)...
In the last few years, Reinforcement Learning (RL), also called adaptive (or approximate) dynamic pr...
Semi-Markov Decision Problems are continuous time generalizations of discrete time Markov Decision P...
The paper investigates stochastic resource allocation problems with scarce, reusable resources and n...
We consider closed-loop solutions to stochastic optimization problems of resource allocation type. T...
We consider closed-loop solutions to stochastic opti-mization problems of resource allocation type. ...
We consider closed-loop solutions to stochastic optimization problems of resource allocation type. ...
Abstract. We are interested in contributing to solving effectively a particular type of real-time st...
The paper proposes Markov Decision Processes (MDPs) to model production control systems that work in...
Abstract. We describe the structure and the implementation aspects of the dynamic programming proced...
We present a technique for computing approximately optimal solutions to stochastic resource allocati...
Resource allocation is a ubiquitous problem that arises whenever scarce resources have to be distrib...
The principal characteristic of stochastic adaptive optimization problems is the uncertainty in the ...
In this paper, resource allocation problems are formulated via a set of parallel birth–death process...
We describe the structure and the implementation aspects of the dynamic programming procedure that w...
A stochastic resource allocation model, based on the principles of Markov decision processes (MDPs)...
In the last few years, Reinforcement Learning (RL), also called adaptive (or approximate) dynamic pr...
Semi-Markov Decision Problems are continuous time generalizations of discrete time Markov Decision P...
The paper investigates stochastic resource allocation problems with scarce, reusable resources and n...
We consider closed-loop solutions to stochastic optimization problems of resource allocation type. T...
We consider closed-loop solutions to stochastic opti-mization problems of resource allocation type. ...
We consider closed-loop solutions to stochastic optimization problems of resource allocation type. ...
Abstract. We are interested in contributing to solving effectively a particular type of real-time st...
The paper proposes Markov Decision Processes (MDPs) to model production control systems that work in...
Abstract. We describe the structure and the implementation aspects of the dynamic programming proced...
We present a technique for computing approximately optimal solutions to stochastic resource allocati...
Resource allocation is a ubiquitous problem that arises whenever scarce resources have to be distrib...
The principal characteristic of stochastic adaptive optimization problems is the uncertainty in the ...
In this paper, resource allocation problems are formulated via a set of parallel birth–death process...
We describe the structure and the implementation aspects of the dynamic programming procedure that w...
A stochastic resource allocation model, based on the principles of Markov decision processes (MDPs)...
In the last few years, Reinforcement Learning (RL), also called adaptive (or approximate) dynamic pr...
Semi-Markov Decision Problems are continuous time generalizations of discrete time Markov Decision P...