International audienceThis paper investigates the limit behavior of Markov decision processes made of independent objects evolving in a common environment, when the number of objects (N) goes to infinity. In the finite horizon case, we show that when the number of objects becomes large, the optimal cost of the system converges to the optimal cost of a discrete time system that is deterministic. Convergence also holds for optimal policies. We further provide bounds on the speed of convergence by proving second order results that resemble central limits theorems for the cost and the state of the Markov decision process, with explicit formulas for the limit. These bounds (of order 1/N−−√ ) are proven to be tight in a numerical example. One can...
International audienceWe consider a class of stochastic games with finite number of resource states,...
A Complex System can be defined as a natural, artificial, social, or economic entity whose model inv...
Abstract—We study the convergence of Markov decision pro-cesses, composed of a large number of objec...
International audienceThis paper investigates the limit behavior of Markov decision processes made o...
This paper investigates the limit behavior of Markov Decision Processes (MDPs) made of independent p...
International audienceThis paper investigates the limit behavior of Markov decision processes made o...
We consider mean-field control problems in discrete time with discounted reward, infinite time horiz...
We study the convergence of Markov decision processes, composed of a large number of objects, to opt...
We study the convergence of Markov Decision Processes made of a large number of objects to optimizat...
Conclusion Motivation, description of the problem A Markov Decision Process We consider: System of N...
We consider a finite number of $N$ statistically equal individuals, each moving on a finite set of s...
Session 03 : Markov decision processes and mean field modelsInternational audienceIn this talk, I wi...
Staudigl M. A limit theorem for Markov decision processes. Center for Mathematical Economics Working...
We consider a generic mean-field scenario, in which a sequence of population models, described by di...
Abstract. We prove a central limit theorem for a class of additive processes that arise naturally in...
International audienceWe consider a class of stochastic games with finite number of resource states,...
A Complex System can be defined as a natural, artificial, social, or economic entity whose model inv...
Abstract—We study the convergence of Markov decision pro-cesses, composed of a large number of objec...
International audienceThis paper investigates the limit behavior of Markov decision processes made o...
This paper investigates the limit behavior of Markov Decision Processes (MDPs) made of independent p...
International audienceThis paper investigates the limit behavior of Markov decision processes made o...
We consider mean-field control problems in discrete time with discounted reward, infinite time horiz...
We study the convergence of Markov decision processes, composed of a large number of objects, to opt...
We study the convergence of Markov Decision Processes made of a large number of objects to optimizat...
Conclusion Motivation, description of the problem A Markov Decision Process We consider: System of N...
We consider a finite number of $N$ statistically equal individuals, each moving on a finite set of s...
Session 03 : Markov decision processes and mean field modelsInternational audienceIn this talk, I wi...
Staudigl M. A limit theorem for Markov decision processes. Center for Mathematical Economics Working...
We consider a generic mean-field scenario, in which a sequence of population models, described by di...
Abstract. We prove a central limit theorem for a class of additive processes that arise naturally in...
International audienceWe consider a class of stochastic games with finite number of resource states,...
A Complex System can be defined as a natural, artificial, social, or economic entity whose model inv...
Abstract—We study the convergence of Markov decision pro-cesses, composed of a large number of objec...