AbstractWe consider an approximation scheme for solving Markov decision processes (MDPs) with countable state space, finite action space, and bounded rewards that uses an approximate solution of a fixed finite-horizon sub-MDP of a given infinite-horizon MDP to create a stationary policy, which we call “approximate receding horizon control.” We first analyze the performance of the approximate receding horizon control for infinite-horizon average reward under an ergodicity assumption, which also generalizes the result obtained by White (J. Oper. Res. Soc. 33 (1982) 253–259). We then study two examples of the approximate receding horizon control via lower bounds to the exact solution to the sub-MDP. The first control policy is based on a finit...
We study reinforcement learning in an infinite-horizon average-reward setting with linear function a...
The infinite horizon setting is widely adopted for problems of reinforcement learning (RL). These in...
We consider a finite state/action Markov Decision Process over the infinite time horizon, and with t...
Building on the receding horizon approach by Hernandez-Lerma andLasserre in solving Markov decision ...
AbstractWe consider an approximation scheme for solving Markov decision processes (MDPs) with counta...
We consider a receding horizon approach as an approximate solution totwo-person zero-sum Markov game...
We study the behavior of the rolling horizon procedure for semi-Markov decision processes, with infi...
We consider mean-field control problems in discrete time with discounted reward, infinite time horiz...
We consider the problem of approximating the values and the optimal policies in risk-averse discount...
AbstractThis paper studies the minimizing risk problems in Markov decision processes with countable ...
International audienceWe study the behaviour of the rolling horizon procedure for the case of two-pe...
AbstractIn a decision process (gambling or dynamic programming problem) with finite state space and ...
International audienceWe study the problem of receding horizon control for stochastic discrete-time ...
In this dissertation, we show a number of new results relating to stability, optimal control, and va...
In this dissertation, we show a number of new results relating to stability, optimal control, and va...
We study reinforcement learning in an infinite-horizon average-reward setting with linear function a...
The infinite horizon setting is widely adopted for problems of reinforcement learning (RL). These in...
We consider a finite state/action Markov Decision Process over the infinite time horizon, and with t...
Building on the receding horizon approach by Hernandez-Lerma andLasserre in solving Markov decision ...
AbstractWe consider an approximation scheme for solving Markov decision processes (MDPs) with counta...
We consider a receding horizon approach as an approximate solution totwo-person zero-sum Markov game...
We study the behavior of the rolling horizon procedure for semi-Markov decision processes, with infi...
We consider mean-field control problems in discrete time with discounted reward, infinite time horiz...
We consider the problem of approximating the values and the optimal policies in risk-averse discount...
AbstractThis paper studies the minimizing risk problems in Markov decision processes with countable ...
International audienceWe study the behaviour of the rolling horizon procedure for the case of two-pe...
AbstractIn a decision process (gambling or dynamic programming problem) with finite state space and ...
International audienceWe study the problem of receding horizon control for stochastic discrete-time ...
In this dissertation, we show a number of new results relating to stability, optimal control, and va...
In this dissertation, we show a number of new results relating to stability, optimal control, and va...
We study reinforcement learning in an infinite-horizon average-reward setting with linear function a...
The infinite horizon setting is widely adopted for problems of reinforcement learning (RL). These in...
We consider a finite state/action Markov Decision Process over the infinite time horizon, and with t...