Many iterative algorithms employ operators which are difficult to evaluate exactly, but for which a graduated range of approximations exist. In such cases, coarse-to-fine algorithms are often used, in which a crude initial operator approximation is gradually refined with new iterations. In such algorithms, because the computational complexity increases over iterations, the algorithm's convergence rate is properly calculated with respect to cumulative computation time. This suggests the problem of determining an optimal rate of refinement for the operator approximation. This paper shows that, for linearly convergent algorithm, the optimal rate of refinement approaches the rate of convergence of the exact algorithm itself, regardless of the t...
We consider the numerical solution of discrete-time, stationary, infinite horizon, discounted stocha...
AbstractWe propose a new adaptive algorithm with decreasing step-size for stochastic approximations....
Most of the current theory for dynamic programming algorithms focuses on finite state, finite action...
Caption title.Bibliography: p. 13.Supported, in part, by the ARO under grant DAAL03-86-K-0171 Suppor...
In this paper we will consider several variants of the standard successive approximation technique f...
In this paper we study a class of modified policy iteration algorithms for solving Markov decision p...
summary:The present paper studies the approximate value iteration (AVI) algorithm for the average co...
We consider the problem of finding an optimal policy in a Markov decision process that maximises the...
We propose a new adaptive algorithm with decreasing step-size for stochastic approximations. The use...
This research is an effort to improve the performance of successive approximation algorithm with a p...
This research focuses on Markov Decision Processes (MDP). MDP is one of the most important and chall...
The aim of this paper is to give an overview of recent developments in the area of successive approx...
In this paper we study the numerical approximation of the optimal long-run average cost of a continu...
We propose a new adaptive algorithm with decreasing step-size for stochastic approximations. The use...
We consider the discrete-time infinite-horizon optimal control problem formalized by Markov de-cisio...
We consider the numerical solution of discrete-time, stationary, infinite horizon, discounted stocha...
AbstractWe propose a new adaptive algorithm with decreasing step-size for stochastic approximations....
Most of the current theory for dynamic programming algorithms focuses on finite state, finite action...
Caption title.Bibliography: p. 13.Supported, in part, by the ARO under grant DAAL03-86-K-0171 Suppor...
In this paper we will consider several variants of the standard successive approximation technique f...
In this paper we study a class of modified policy iteration algorithms for solving Markov decision p...
summary:The present paper studies the approximate value iteration (AVI) algorithm for the average co...
We consider the problem of finding an optimal policy in a Markov decision process that maximises the...
We propose a new adaptive algorithm with decreasing step-size for stochastic approximations. The use...
This research is an effort to improve the performance of successive approximation algorithm with a p...
This research focuses on Markov Decision Processes (MDP). MDP is one of the most important and chall...
The aim of this paper is to give an overview of recent developments in the area of successive approx...
In this paper we study the numerical approximation of the optimal long-run average cost of a continu...
We propose a new adaptive algorithm with decreasing step-size for stochastic approximations. The use...
We consider the discrete-time infinite-horizon optimal control problem formalized by Markov de-cisio...
We consider the numerical solution of discrete-time, stationary, infinite horizon, discounted stocha...
AbstractWe propose a new adaptive algorithm with decreasing step-size for stochastic approximations....
Most of the current theory for dynamic programming algorithms focuses on finite state, finite action...