We study the convergence of Markov Decision Processes made of a large number of objects to optimization problems on ordinary differential equations (ODE). We show that the optimal reward of such a Markov Decision Process, satisfying a Bellman equation, converges to the solution of a continuous Hamilton-Jacobi-Bellman (HJB) equation based on the mean field approximation of the Markov Decision Process. We give bounds on the difference of the rewards, and a constructive algorithm for deriving an approximating solution to the Markov Decision Process from a solution of the HJB equations. We illustrate the method on three examples pertaining respectively to investment strategies, population dynamics control and scheduling in queues are developed....
Abstract. State-based systems with discrete or continuous time are of-ten modelled with the help of ...
41 pagesWe develop an exhaustive study of Markov decision process (MDP) under mean field interaction...
We consider a mean field control problem with regime switching in the state dynamics. The correspond...
We study the convergence of Markov decision processes, composed of a large number of objects, to opt...
We study the convergence of Markov Decision Processes made of a large number of objects to optimizat...
Abstract—We study the convergence of Markov decision pro-cesses, composed of a large number of objec...
We consider a finite number of $N$ statistically equal individuals, each moving on a finite set of s...
We consider mean-field control problems in discrete time with discounted reward, infinite time horiz...
International audienceThis paper investigates the limit behavior of Markov decision processes made o...
This paper investigates the limit behavior of Markov Decision Processes (MDPs) made of independent p...
Conclusion Motivation, description of the problem A Markov Decision Process We consider: System of N...
Session 03 : Markov decision processes and mean field modelsInternational audienceIn this talk, I wi...
International audienceThis paper investigates the limit behavior of Markov decision processes made o...
Staudigl M. A limit theorem for Markov decision processes. Center for Mathematical Economics Working...
The mean-field game theory is the study of strategic decision making in very large populations of we...
Abstract. State-based systems with discrete or continuous time are of-ten modelled with the help of ...
41 pagesWe develop an exhaustive study of Markov decision process (MDP) under mean field interaction...
We consider a mean field control problem with regime switching in the state dynamics. The correspond...
We study the convergence of Markov decision processes, composed of a large number of objects, to opt...
We study the convergence of Markov Decision Processes made of a large number of objects to optimizat...
Abstract—We study the convergence of Markov decision pro-cesses, composed of a large number of objec...
We consider a finite number of $N$ statistically equal individuals, each moving on a finite set of s...
We consider mean-field control problems in discrete time with discounted reward, infinite time horiz...
International audienceThis paper investigates the limit behavior of Markov decision processes made o...
This paper investigates the limit behavior of Markov Decision Processes (MDPs) made of independent p...
Conclusion Motivation, description of the problem A Markov Decision Process We consider: System of N...
Session 03 : Markov decision processes and mean field modelsInternational audienceIn this talk, I wi...
International audienceThis paper investigates the limit behavior of Markov decision processes made o...
Staudigl M. A limit theorem for Markov decision processes. Center for Mathematical Economics Working...
The mean-field game theory is the study of strategic decision making in very large populations of we...
Abstract. State-based systems with discrete or continuous time are of-ten modelled with the help of ...
41 pagesWe develop an exhaustive study of Markov decision process (MDP) under mean field interaction...
We consider a mean field control problem with regime switching in the state dynamics. The correspond...