The standard approach to stochastic control is dynamic programming. In this paper, we introduce an alternative approach based on direct comparison of the performance of any two policies. This is achieved by modeling the state process as a continuous-time and continuous-state Markov process and applying the same ideas as for the discrete-time and discrete-state case. This approach is simple and intuitively clear; it applies to different problems with, finite and infinite horizons, discounted and long-run-average performance, continuous and jump diffusions, in the same way. Discounting is not needed when dealing with long-run average performance. The approach provides a unified framework for stochastic control and other optimization theory an...
Sequential decision-making via dynamic programming. Unified approach to optimal control of stochasti...
We study stochasticmotion planning problems which involve a controlled process, with possibly discon...
The paper deals with continuous time Markov decision processes on a fairly general state space. The ...
Recently, a direct-comparison approach has been developed to control and optimize the performance of...
This thesis is devoted to the extension of the recently developed direct comparison approach from th...
Stochastic Control Theory is concerned with the control of dynamical systems which are random in som...
We introduce a sensitivity-based view to the area of learning and optimization of stochastic dynamic...
Another approach to finite differences is the well developed Markov Chain Approximation (MCA) of Kus...
In this thesis we investigate single and multi-player stochastic dynamic optimization prob-lems. We ...
The intent of this book is to present recent results in the control theory for the long run average ...
Testing theorems are received for controlled diffusion processes with progressive and pulse equation...
A two-timescale simulation-based actor-critic algorithm for solution of infinite horizon Markov deci...
In Chapter 2, we propose several two-timescale simulation-based actor-critic algorithms for solution...
In this paper, we survey several recent developments on non- standard optimality criteria for contro...
Abstract. The stochastic versions of classical discrete optimal control problems are formulated and ...
Sequential decision-making via dynamic programming. Unified approach to optimal control of stochasti...
We study stochasticmotion planning problems which involve a controlled process, with possibly discon...
The paper deals with continuous time Markov decision processes on a fairly general state space. The ...
Recently, a direct-comparison approach has been developed to control and optimize the performance of...
This thesis is devoted to the extension of the recently developed direct comparison approach from th...
Stochastic Control Theory is concerned with the control of dynamical systems which are random in som...
We introduce a sensitivity-based view to the area of learning and optimization of stochastic dynamic...
Another approach to finite differences is the well developed Markov Chain Approximation (MCA) of Kus...
In this thesis we investigate single and multi-player stochastic dynamic optimization prob-lems. We ...
The intent of this book is to present recent results in the control theory for the long run average ...
Testing theorems are received for controlled diffusion processes with progressive and pulse equation...
A two-timescale simulation-based actor-critic algorithm for solution of infinite horizon Markov deci...
In Chapter 2, we propose several two-timescale simulation-based actor-critic algorithms for solution...
In this paper, we survey several recent developments on non- standard optimality criteria for contro...
Abstract. The stochastic versions of classical discrete optimal control problems are formulated and ...
Sequential decision-making via dynamic programming. Unified approach to optimal control of stochasti...
We study stochasticmotion planning problems which involve a controlled process, with possibly discon...
The paper deals with continuous time Markov decision processes on a fairly general state space. The ...