We propose the use of Model Predictive Control (MPC) for controlling systems described by Markov decision processes. First, we consider a straightforward MPC algorithm for Markov decision processes. Then, we propose value functions, a means to deal with issues arising in conventional MPC, e.g., computational requirements and sub-optimality of actions. We use reinforcement learning to let an MPC agent learn a value function incrementally. The agent incorporates experience from the interaction with the system in its decision making. Our approach initially relies on pure MPC. Over time, as experience increases, the learned value function is taken more and more into account. This speeds up the decision making, allows decisions to be made over a...
Learning in Partially Observable Markov Decision process (POMDP) is motivated by the essential need ...
This chapter presents an overview of simulation-based techniques useful for solving Markov decision ...
This work addresses the problem of estimating the optimal value function in a Markov Decision Proces...
Model predictive control (MPC) is becoming an increasingly popular method to select actions for cont...
peer reviewedModel predictive control (MPC) and reinforcement learning (RL) are two popular families...
Reinforcement Learning (RL) has demonstrated a huge potential in learning optimal policies without a...
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning contr...
We consider the inverse reinforcement learning problem, that is, the problem of learning from, and t...
Despite the success of reinforcement learning (RL) in various research fields, relatively few algori...
A model predictive controller based on recursive learning is proposed. In this SISO adaptive control...
In control systems theory, the Markov decision process (MDP) is a widely used optimization model inv...
Model predictive control (MPC) offers an optimal control technique to establish and ensure that the ...
We study reinforcement learning for the optimal control of Branching Markov Decision Processes (BMDP...
Markov Decision Processes (MDPs) are a mathematical framework for modeling sequential decision probl...
In model predictive control (MPC), also called recedinghorizon control, the control input is obtaine...
Learning in Partially Observable Markov Decision process (POMDP) is motivated by the essential need ...
This chapter presents an overview of simulation-based techniques useful for solving Markov decision ...
This work addresses the problem of estimating the optimal value function in a Markov Decision Proces...
Model predictive control (MPC) is becoming an increasingly popular method to select actions for cont...
peer reviewedModel predictive control (MPC) and reinforcement learning (RL) are two popular families...
Reinforcement Learning (RL) has demonstrated a huge potential in learning optimal policies without a...
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning contr...
We consider the inverse reinforcement learning problem, that is, the problem of learning from, and t...
Despite the success of reinforcement learning (RL) in various research fields, relatively few algori...
A model predictive controller based on recursive learning is proposed. In this SISO adaptive control...
In control systems theory, the Markov decision process (MDP) is a widely used optimization model inv...
Model predictive control (MPC) offers an optimal control technique to establish and ensure that the ...
We study reinforcement learning for the optimal control of Branching Markov Decision Processes (BMDP...
Markov Decision Processes (MDPs) are a mathematical framework for modeling sequential decision probl...
In model predictive control (MPC), also called recedinghorizon control, the control input is obtaine...
Learning in Partially Observable Markov Decision process (POMDP) is motivated by the essential need ...
This chapter presents an overview of simulation-based techniques useful for solving Markov decision ...
This work addresses the problem of estimating the optimal value function in a Markov Decision Proces...