The aim of this paper is to give an overview of recent developments in the area of successive approximations for Markov decision processes and Markov games. We will emphasize two aspects, viz. the conditions under which successive approximations converge in some strong sense and variations of these methods which diminish the amount of computational work to be executed. With respect to the first aspect it will be shown how much unboundedness of the rewards may be allowed without violation of the convergence. With respect to the second aspect we will present four ideas, that can be applied in conjunction, which may diminish the amount of work to be done. These ideas are: I. the use of the actual convergence of the iterates for the constructio...
This paper presents a number of successive approximation algorithms for the repeated two-person zero...
The first part of this survey paper is devoted to derive under rather weak conditions, which don't g...
The first part of this survey paper is devoted to derive under rather weak conditions, which don't g...
The aim of this paper is to give an overview of recent developments in the area of successive approx...
The aim of this paper is to give a survey of recent developments in the area of successive approxima...
The aim of this paper is to give a survey of recent developments in the area of successive approxima...
The aim of this paper is to give a survey of recent developments in the area of successive approxima...
The aim of this paper is to give a survey of recent developments in the area of successive approxima...
The aim of this paper is to give an overview of recent developments in the area of successive approx...
The aim of this paper is to give an overview of recent developments in the area of successive approx...
Markov decision processes which allow for an unbounded reward structure are considered. Conditions a...
In this paper an overview will be presented of the applicability of successive approximation methods...
In this paper we will consider several variants of the standard successive approximation technique f...
In this paper we will consider several variants of the standard successive approximation technique f...
The first part of this survey paper is devoted to derive under rather weak conditions, which don't g...
This paper presents a number of successive approximation algorithms for the repeated two-person zero...
The first part of this survey paper is devoted to derive under rather weak conditions, which don't g...
The first part of this survey paper is devoted to derive under rather weak conditions, which don't g...
The aim of this paper is to give an overview of recent developments in the area of successive approx...
The aim of this paper is to give a survey of recent developments in the area of successive approxima...
The aim of this paper is to give a survey of recent developments in the area of successive approxima...
The aim of this paper is to give a survey of recent developments in the area of successive approxima...
The aim of this paper is to give a survey of recent developments in the area of successive approxima...
The aim of this paper is to give an overview of recent developments in the area of successive approx...
The aim of this paper is to give an overview of recent developments in the area of successive approx...
Markov decision processes which allow for an unbounded reward structure are considered. Conditions a...
In this paper an overview will be presented of the applicability of successive approximation methods...
In this paper we will consider several variants of the standard successive approximation technique f...
In this paper we will consider several variants of the standard successive approximation technique f...
The first part of this survey paper is devoted to derive under rather weak conditions, which don't g...
This paper presents a number of successive approximation algorithms for the repeated two-person zero...
The first part of this survey paper is devoted to derive under rather weak conditions, which don't g...
The first part of this survey paper is devoted to derive under rather weak conditions, which don't g...