In this study, we consider the infinite-horizon, discounted cost, optimal control of stochastic nonlinear systems with separable cost and constraints in the state and input variables. Using the linear-time Legendre transform, we propose a novel numerical scheme for implementation of the corresponding value iteration (VI) algorithm in the conjugate domain. Detailed analyses of the convergence, time complexity, and error of the proposed algorithm are provided. In particular, with a discretization of size X and U for the state and input spaces, respectively, the proposed approach reduces the time complexity of each iteration in the VI algorithm from O(XU) to O(X+U), by replacing the minimization operation in the primal domain with a simple add...
summary:This paper is related to Markov Decision Processes. The optimal control problem is to minimi...
Abstract. We study the existence of optimal strategies and value func-tion of non stationary Markov ...
We consider the classical finite-state discounted Markovian decision problem, and we introduce a new...
In this work, we deal with a discrete-time infinite horizon Markov decision process with locally com...
A two-timescale simulation-based actor-critic algorithm for solution of infinite horizon Markov deci...
This article proposes a three-timescale simulation based algorithm for solution of infinite horizon ...
In this paper, approximate dynamic programming (ADP) problems are modeled by discounted infinite-hor...
The running time of the classical algorithms of the Markov Decision Process (MDP) typically grows li...
This paper addresses the optimality of stochastic control strategies based on the infinite horizon a...
We consider approximate dynamic programming for the infinite-horizon stationary γ-discounted optimal...
In Chapter 2, we propose several two-timescale simulation-based actor-critic algorithms for solution...
We analyze the per unit-time infinite horizon average cost Markov control model, subject to a total ...
We consider infinite horizon stochastic dynamic programs with discounted costs and study how to use ...
This paper addresses the computational issues involved in the solution to an infinite-horizon optima...
We consider finite-state Markov decision processes, and prove convergence and rate of convergence re...
summary:This paper is related to Markov Decision Processes. The optimal control problem is to minimi...
Abstract. We study the existence of optimal strategies and value func-tion of non stationary Markov ...
We consider the classical finite-state discounted Markovian decision problem, and we introduce a new...
In this work, we deal with a discrete-time infinite horizon Markov decision process with locally com...
A two-timescale simulation-based actor-critic algorithm for solution of infinite horizon Markov deci...
This article proposes a three-timescale simulation based algorithm for solution of infinite horizon ...
In this paper, approximate dynamic programming (ADP) problems are modeled by discounted infinite-hor...
The running time of the classical algorithms of the Markov Decision Process (MDP) typically grows li...
This paper addresses the optimality of stochastic control strategies based on the infinite horizon a...
We consider approximate dynamic programming for the infinite-horizon stationary γ-discounted optimal...
In Chapter 2, we propose several two-timescale simulation-based actor-critic algorithms for solution...
We analyze the per unit-time infinite horizon average cost Markov control model, subject to a total ...
We consider infinite horizon stochastic dynamic programs with discounted costs and study how to use ...
This paper addresses the computational issues involved in the solution to an infinite-horizon optima...
We consider finite-state Markov decision processes, and prove convergence and rate of convergence re...
summary:This paper is related to Markov Decision Processes. The optimal control problem is to minimi...
Abstract. We study the existence of optimal strategies and value func-tion of non stationary Markov ...
We consider the classical finite-state discounted Markovian decision problem, and we introduce a new...