We study the properties of the rolling horizon and the approximate rolling horizon procedures for the case of two-person zero-sum discounted semi-Markov games with infi nite horizon, under several assumptions on the reward function, when the state space is a borelian set and the action spaces are considered compact. Under suitable conditions, we prove that the equilibrium is the unique solution of its dynamic programming equation, and we prove bounds which imply the convergence of the procedures when the horizon length tends to in finity. The approach is based on the formalism for Semi-Markov games developed by Luque-Vásquez, together with extensions of the results of Hernández-Lerma and Lasserre for Markov Decision Processes and Chang and ...
Parallel sessionInternational audienceThe existence of a limit for the values of discounted zero-sum...
Building on the receding horizon approach by Hernandez-Lerma andLasserre in solving Markov decision ...
Cette thèse se compose de deux parties indépendantes et la première regroupant deux problématiques d...
We study the properties of the rolling horizon and the approximate rolling horizon procedures for th...
We consider the problem of approximating the values and the equilibria in two-person zero-sum discou...
International audienceWe study the behaviour of the rolling horizon procedure for the case of two-pe...
This paper is concerned with the links between the Value Iteration algorithm and the Rolling Horizon...
We study the behavior of the rolling horizon procedure for semi-Markov decision processes, with infi...
We consider the problem of approximating the values and the optimal policies in risk-averse discount...
We consider the problem of approximating the values and the optimal policies in risk-averse discount...
We consider a receding horizon approach as an approximate solution totwo-person zero-sum Markov game...
This paper formulates the optimal decentralized control problem for a class of mathematicalmodels in...
We consider problems that involve the sequential selection of decisions in order to minimize expecte...
Parallel session/ slidesInternational audienceRecent results of Ye and Hansen, Miltersen and Zwick s...
National audienceSolving a 2-player zero-sum partially observable stochastic game (zs-POSG) typicall...
Parallel sessionInternational audienceThe existence of a limit for the values of discounted zero-sum...
Building on the receding horizon approach by Hernandez-Lerma andLasserre in solving Markov decision ...
Cette thèse se compose de deux parties indépendantes et la première regroupant deux problématiques d...
We study the properties of the rolling horizon and the approximate rolling horizon procedures for th...
We consider the problem of approximating the values and the equilibria in two-person zero-sum discou...
International audienceWe study the behaviour of the rolling horizon procedure for the case of two-pe...
This paper is concerned with the links between the Value Iteration algorithm and the Rolling Horizon...
We study the behavior of the rolling horizon procedure for semi-Markov decision processes, with infi...
We consider the problem of approximating the values and the optimal policies in risk-averse discount...
We consider the problem of approximating the values and the optimal policies in risk-averse discount...
We consider a receding horizon approach as an approximate solution totwo-person zero-sum Markov game...
This paper formulates the optimal decentralized control problem for a class of mathematicalmodels in...
We consider problems that involve the sequential selection of decisions in order to minimize expecte...
Parallel session/ slidesInternational audienceRecent results of Ye and Hansen, Miltersen and Zwick s...
National audienceSolving a 2-player zero-sum partially observable stochastic game (zs-POSG) typicall...
Parallel sessionInternational audienceThe existence of a limit for the values of discounted zero-sum...
Building on the receding horizon approach by Hernandez-Lerma andLasserre in solving Markov decision ...
Cette thèse se compose de deux parties indépendantes et la première regroupant deux problématiques d...