Abstract — In this paper we introduce an online algorithm that uses integral reinforcement knowledge for learning the continuous-time optimal control solution for nonlinear systems with infinite horizon costs and partial knowledge of the system dynamics. This algorithm is a data based approach to the solution of the Hamilton-Jacobi-Bellman equation and it does not require explicit knowledge on the system’s drift dynamics. The adaptive algorithm use the structure of policy iteration, and it is implemented on an actor/critic structure. Both actor and critic neural networks are adapted simultaneously and a persistence of excitation condition is required to guarantee convergence of the critic to the actual optimal value function. &ovel tuni...
This paper presents a reinforcement learning framework for continuous-time dynamical systems without...
This paper proposes a new approximate dynamic programming algorithm to solve the infinite-horizon op...
This paper presents an online learning algorithm based on integral reinforcement learning (IRL) to d...
In this paper we introduce an online algorithm that uses integral reinforcement knowledge for learni...
This paper presents an online policy iteration (PI) algorithm to learn the continuous-time optimal c...
This paper is an effort towards developing an online learning algorithm to find the optimal control ...
In this paper, an integral reinforcement learning (IRL) algorithm on an actor-critic structure is de...
In this paper, a new formulation for the optimal tracking control problem (OTCP) of continuous-time ...
Abstract — In this paper, using a neural-network-based online learning optimal control approach, a n...
This chapter presents adaptive solutions to the optimal tracking problem of nonlinear discrete-time ...
Online adaptive optimal control methods based on reinforcement learning algorithms typically need to...
IEEE Catalog Number: CFP15SIP-USBA new policy-iteration algorithm based on neural networks (NNs) is ...
Abstract — This paper proposes a control algorithm based on adaptive dynamic programming to solve th...
Published online: 11 Aug 2019.This study proposes a modified value-function-approximation (MVFA) and ...
In this article, an actor-critic neural network (NN)-based online optimal adaptive regulation of a c...
This paper presents a reinforcement learning framework for continuous-time dynamical systems without...
This paper proposes a new approximate dynamic programming algorithm to solve the infinite-horizon op...
This paper presents an online learning algorithm based on integral reinforcement learning (IRL) to d...
In this paper we introduce an online algorithm that uses integral reinforcement knowledge for learni...
This paper presents an online policy iteration (PI) algorithm to learn the continuous-time optimal c...
This paper is an effort towards developing an online learning algorithm to find the optimal control ...
In this paper, an integral reinforcement learning (IRL) algorithm on an actor-critic structure is de...
In this paper, a new formulation for the optimal tracking control problem (OTCP) of continuous-time ...
Abstract — In this paper, using a neural-network-based online learning optimal control approach, a n...
This chapter presents adaptive solutions to the optimal tracking problem of nonlinear discrete-time ...
Online adaptive optimal control methods based on reinforcement learning algorithms typically need to...
IEEE Catalog Number: CFP15SIP-USBA new policy-iteration algorithm based on neural networks (NNs) is ...
Abstract — This paper proposes a control algorithm based on adaptive dynamic programming to solve th...
Published online: 11 Aug 2019.This study proposes a modified value-function-approximation (MVFA) and ...
In this article, an actor-critic neural network (NN)-based online optimal adaptive regulation of a c...
This paper presents a reinforcement learning framework for continuous-time dynamical systems without...
This paper proposes a new approximate dynamic programming algorithm to solve the infinite-horizon op...
This paper presents an online learning algorithm based on integral reinforcement learning (IRL) to d...