This paper reviews the current state of the art on reinforcement learning (RL)-based feedback control solutions to optimal regulation and tracking of single and multiagent systems. Existing RL solutions to both optimal H2 and H∞ control problems, as well as graphical games, will be reviewed. RL methods learn the solution to optimal control and game problems online and using measured data along the system trajectories. We discuss Q-learning and the integral RL algorithm as core algorithms for discrete-time (DT) and continuous-time (CT) systems, respectively. Moreover, we discuss a new direction of off-policy RL for both CT and DT systems. Finally, we review several applications
Optimal control and Reinforcement Learning deal both with sequential decision-making problems, altho...
On-line learning methods have been applied successfully in multi-agent systems to achieve coordinati...
Recent years have witnessed phenomenal accomplishments of reinforcement learning (RL) in many promin...
This chapter presents adaptive solutions to the optimal tracking problem of nonlinear discrete-time ...
The framework of dynamic programming (DP) and reinforcement learning (RL) can be used to express imp...
This paper proposes a model-free H∞ control design for linear discrete-time systems using reinforcem...
This article offers an optimal control tracking method using an event-triggered technique and the in...
This paper presents an online learning algorithm based on integral reinforcement learning (IRL) to d...
This paper investigates the application of associative reinforcement learning techniques to the opti...
This paper proposes an original data-driven intelligent control solution to the cooperative output r...
In this paper, reinforcement learning (RL) is employed to find a casual solution to the linear quadr...
Abstract—In this paper, two Q-learning (QL) methods are proposed and their convergence theories are ...
Traditional feedback control methods are often model-based and the mathematical system models need t...
In this article, we propose a new reinforcement learning (RL) method for a system having continuous ...
Conventional closed‐form solution to the optimal control problem using optimal control theory is onl...
Optimal control and Reinforcement Learning deal both with sequential decision-making problems, altho...
On-line learning methods have been applied successfully in multi-agent systems to achieve coordinati...
Recent years have witnessed phenomenal accomplishments of reinforcement learning (RL) in many promin...
This chapter presents adaptive solutions to the optimal tracking problem of nonlinear discrete-time ...
The framework of dynamic programming (DP) and reinforcement learning (RL) can be used to express imp...
This paper proposes a model-free H∞ control design for linear discrete-time systems using reinforcem...
This article offers an optimal control tracking method using an event-triggered technique and the in...
This paper presents an online learning algorithm based on integral reinforcement learning (IRL) to d...
This paper investigates the application of associative reinforcement learning techniques to the opti...
This paper proposes an original data-driven intelligent control solution to the cooperative output r...
In this paper, reinforcement learning (RL) is employed to find a casual solution to the linear quadr...
Abstract—In this paper, two Q-learning (QL) methods are proposed and their convergence theories are ...
Traditional feedback control methods are often model-based and the mathematical system models need t...
In this article, we propose a new reinforcement learning (RL) method for a system having continuous ...
Conventional closed‐form solution to the optimal control problem using optimal control theory is onl...
Optimal control and Reinforcement Learning deal both with sequential decision-making problems, altho...
On-line learning methods have been applied successfully in multi-agent systems to achieve coordinati...
Recent years have witnessed phenomenal accomplishments of reinforcement learning (RL) in many promin...