We consider the solution of stochastic dynamic programs using sample path estimates. Applying the theory of large deviations, we derive probability error bounds associated with the convergence of the estimated optimal policy to the true optimal policy, for finite horizon problems. These bounds decay at an exponential rate, in contrast with the usual canonical (inverse) square root rate associated with estimation of the value (cost-to-go) function itself. These results have practical implications for Monte Carlo simulation-based solution approaches to stochastic dynamic programming problems where it is impractical to extract the explicit transition probabilities of the underlying system model
In stochastic optimal control the distribution of the exogenous noise is typically unknown and must ...
Title: Stochastic Dynamic Programming Problems: Theory and Applications Author: Gabriel Lendel Depar...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
In this paper we discuss Monte Carlo simulation based approximations of a stochastic programming pro...
In this paper we discuss Monte Carlo simulation based approximations of a stochastic programming pro...
We consider a class of stochastic mathematical programs with complementarity constraints, in which b...
We study a model of stochastic evolutionary game dynamics in which the probabilities that agents cho...
This paper develops a new method for constructing approximate solutions to discrete time, infinite h...
textabstractWe consider a class of stochastic mathematical programs with complementarity constraints...
We consider a class of stochastic mathematical programs with complementarity constraints, in which b...
We present experimental results about learning function values (i.e. Bellman values) in stochastic d...
In this paper, we compare the performance of two scenario-based numerical methods to solve stochasti...
International audienceMany stochastic dynamic programming tasks in continuous action-spaces are tack...
This paper examines the properties of various approximation methods for solving stochastic dynamic p...
Caption title.Includes bibliographical references (leaf [7]).Supported by NSF. ECS 9216531 Supported...
In stochastic optimal control the distribution of the exogenous noise is typically unknown and must ...
Title: Stochastic Dynamic Programming Problems: Theory and Applications Author: Gabriel Lendel Depar...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
In this paper we discuss Monte Carlo simulation based approximations of a stochastic programming pro...
In this paper we discuss Monte Carlo simulation based approximations of a stochastic programming pro...
We consider a class of stochastic mathematical programs with complementarity constraints, in which b...
We study a model of stochastic evolutionary game dynamics in which the probabilities that agents cho...
This paper develops a new method for constructing approximate solutions to discrete time, infinite h...
textabstractWe consider a class of stochastic mathematical programs with complementarity constraints...
We consider a class of stochastic mathematical programs with complementarity constraints, in which b...
We present experimental results about learning function values (i.e. Bellman values) in stochastic d...
In this paper, we compare the performance of two scenario-based numerical methods to solve stochasti...
International audienceMany stochastic dynamic programming tasks in continuous action-spaces are tack...
This paper examines the properties of various approximation methods for solving stochastic dynamic p...
Caption title.Includes bibliographical references (leaf [7]).Supported by NSF. ECS 9216531 Supported...
In stochastic optimal control the distribution of the exogenous noise is typically unknown and must ...
Title: Stochastic Dynamic Programming Problems: Theory and Applications Author: Gabriel Lendel Depar...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...