We study the properties of the rolling horizon and the approximate rolling horizon procedures for the case of two-person zero-sum discounted semi-Markov games with infinite horizon, when the state space is a borelian set and the action spaces are considered compact. Under suitable conditions, we prove that the equilibrium is the unique solution of a dynamic programming equation, and we prove bounds which imply the convergence of the procedures when the horizon length tends to infinity. The approach is based on the formalism for Semi-Markov games developed by Luque-Vásquez in [11], together with extensions of the results of Hernández-Lerma and Lasserre [4] for Markov Decision Processes and Chang and Marcus [2] for Markov Games, both in discr...
This paper presents a number of successive approximation algorithms for the repeated two-person zero...
This paper has a two-fold purpose. First, we attempt to outline the development of the turnpike theo...
We consider the problem of approximating the values and the optimal policies in risk-averse discount...
International audienceWe study the behaviour of the rolling horizon procedure for the case of two-pe...
We study the behavior of the rolling horizon procedure for semi-Markov decision processes, with infi...
We study the properties of the rolling horizon and the approximate rolling horizon procedures for th...
We consider the problem of approximating the values and the optimal policies in risk-averse discount...
Canonical models of Markov decision processes (MDPs) usually consider geometric discounting based on...
In this paper, we consider two-person zero-sum discounted Markov games with finite state and action ...
We study partially observable semi-Markov game with discounted payoff on a Borel state space. We stu...
We study partially observable semi-Markov game with discounted payoff on a Borel state space. We stu...
This paper presents a number of successive approximation algorithms for the repeated two-person zero...
summary:This paper studies a class of discrete-time discounted semi-Markov control model on Borel sp...
We consider the problem of approximating the values and the equilibria in two-person zero-sum discou...
AbstractThis paper studies the convergence of value-iteration functions and the existence of error b...
This paper presents a number of successive approximation algorithms for the repeated two-person zero...
This paper has a two-fold purpose. First, we attempt to outline the development of the turnpike theo...
We consider the problem of approximating the values and the optimal policies in risk-averse discount...
International audienceWe study the behaviour of the rolling horizon procedure for the case of two-pe...
We study the behavior of the rolling horizon procedure for semi-Markov decision processes, with infi...
We study the properties of the rolling horizon and the approximate rolling horizon procedures for th...
We consider the problem of approximating the values and the optimal policies in risk-averse discount...
Canonical models of Markov decision processes (MDPs) usually consider geometric discounting based on...
In this paper, we consider two-person zero-sum discounted Markov games with finite state and action ...
We study partially observable semi-Markov game with discounted payoff on a Borel state space. We stu...
We study partially observable semi-Markov game with discounted payoff on a Borel state space. We stu...
This paper presents a number of successive approximation algorithms for the repeated two-person zero...
summary:This paper studies a class of discrete-time discounted semi-Markov control model on Borel sp...
We consider the problem of approximating the values and the equilibria in two-person zero-sum discou...
AbstractThis paper studies the convergence of value-iteration functions and the existence of error b...
This paper presents a number of successive approximation algorithms for the repeated two-person zero...
This paper has a two-fold purpose. First, we attempt to outline the development of the turnpike theo...
We consider the problem of approximating the values and the optimal policies in risk-averse discount...