We consider closed-loop solutions to stochastic opti-mization problems of resource allocation type. They concern with the dynamic allocation of reusable re-sources over time to non-preemtive interconnected tasks with stochastic durations. The aim is to minimize the expected value of a regular performance measure. First, we formulate the problem as a stochastic shortest path problem and argue that our formulation has favorable properties, e.g., it has finite horizon, it is acyclic, thus, all policies are proper, and moreover, the space of con-trol policies can be safely restricted. Then, we propose an iterative solution. Essentially, we apply a reinforce-ment learning based adaptive sampler to compute a sub-optimal control policy. We suggest...
For systems with limited capacity for storage, processing and transmission of data, the choice of sa...
Abstract. We describe the structure and the implementation aspects of the dynamic programming proced...
Copyright © 2014 IEEEPresented at IEEE Symposium on Adaptive Dynamic Programming and Reinforcement L...
We consider closed-loop solutions to stochastic optimization problems of resource allocation type. T...
We consider closed-loop solutions to stochastic optimization problems of resource allocation type. ...
The paper investigates stochastic resource allocation problems with scarce, reusable resources and n...
The paper investigates stochastic resource allocation problems with scarce, reusable resources and n...
We propose a new method for learning policies for large, partially observable Markov decision proces...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2...
This dissertation investigates the use of sampling methods for solving stochastic optimization probl...
The principal characteristic of stochastic adaptive optimization problems is the uncertainty in the ...
In this article we study the connection of stochastic optimal control and reinforcement learning. Ou...
We consider a general class of dynamic resource allocation problems within a stochastic optimal cont...
Abstract We present modeling and solution strategies for large-scale resource allocation problems th...
International audienceWe consider the classical problem of sequential resource allocation where a de...
For systems with limited capacity for storage, processing and transmission of data, the choice of sa...
Abstract. We describe the structure and the implementation aspects of the dynamic programming proced...
Copyright © 2014 IEEEPresented at IEEE Symposium on Adaptive Dynamic Programming and Reinforcement L...
We consider closed-loop solutions to stochastic optimization problems of resource allocation type. T...
We consider closed-loop solutions to stochastic optimization problems of resource allocation type. ...
The paper investigates stochastic resource allocation problems with scarce, reusable resources and n...
The paper investigates stochastic resource allocation problems with scarce, reusable resources and n...
We propose a new method for learning policies for large, partially observable Markov decision proces...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2...
This dissertation investigates the use of sampling methods for solving stochastic optimization probl...
The principal characteristic of stochastic adaptive optimization problems is the uncertainty in the ...
In this article we study the connection of stochastic optimal control and reinforcement learning. Ou...
We consider a general class of dynamic resource allocation problems within a stochastic optimal cont...
Abstract We present modeling and solution strategies for large-scale resource allocation problems th...
International audienceWe consider the classical problem of sequential resource allocation where a de...
For systems with limited capacity for storage, processing and transmission of data, the choice of sa...
Abstract. We describe the structure and the implementation aspects of the dynamic programming proced...
Copyright © 2014 IEEEPresented at IEEE Symposium on Adaptive Dynamic Programming and Reinforcement L...