In this paper we propose to integrate the recursive Levenberg-Marquardt method into the adaptive dynamic programming (ADP) design for improved learning and adaptive control performance. Our key motivation is to consider a balanced weight updating strategy with the consideration of both robustness and convergence during the online learning process. Specifically, a modified recursive Levenberg-Marquardt (LM) method is integrated into both the action network and critic network of the ADP design, and a detailed learning algorithm is proposed to implement this approach. We test the performance of our approach based on the triple link inverted pendulum, a popular benchmark in the community, to demonstrate online learning and control strategy. Exp...
Adaptive dynamic programming (ADP) and reinforcement learning are quite relevant to each other when ...
Keywords: Adaptive dynamic programming Neural network Markov jump systems Optimal control a b s t r ...
I herby declare that I am the sole author of this thesis. This is a true copy of the thesis, includi...
In this paper, we propose a novel adaptive dynamic programming (ADP) architecture with three network...
Adaptive dynamic programming (ADP) is a promising research field for design of intelligent controlle...
This paper focuses on the efficiency improvement of online actor-critic design base on the Levenberg...
In this paper, we present a new adaptive dynamic programming approach by integrating a reference net...
Abstract — In this paper, we present a new adaptive dynamic programming approach by integrating a re...
Some three decades ago, certain computational intelligence methods of reinforcement learning were re...
In problems with complex dynamics and challenging state spaces, the dual heuristic programming (DHP)...
Humans have the ability to make use of experience while selecting their control actions for distinct...
An intelligent controller has the ability to analyse an unknown situation and to respond to it accor...
An intelligent controller has the ability to analyse an unknown situation and to respond to it accor...
An intelligent controller has the ability to analyse an unknown situation and to respond to it accor...
Adaptive dynamic programming (ADP) and reinforcement learning are quite relevant to each other when ...
Adaptive dynamic programming (ADP) and reinforcement learning are quite relevant to each other when ...
Keywords: Adaptive dynamic programming Neural network Markov jump systems Optimal control a b s t r ...
I herby declare that I am the sole author of this thesis. This is a true copy of the thesis, includi...
In this paper, we propose a novel adaptive dynamic programming (ADP) architecture with three network...
Adaptive dynamic programming (ADP) is a promising research field for design of intelligent controlle...
This paper focuses on the efficiency improvement of online actor-critic design base on the Levenberg...
In this paper, we present a new adaptive dynamic programming approach by integrating a reference net...
Abstract — In this paper, we present a new adaptive dynamic programming approach by integrating a re...
Some three decades ago, certain computational intelligence methods of reinforcement learning were re...
In problems with complex dynamics and challenging state spaces, the dual heuristic programming (DHP)...
Humans have the ability to make use of experience while selecting their control actions for distinct...
An intelligent controller has the ability to analyse an unknown situation and to respond to it accor...
An intelligent controller has the ability to analyse an unknown situation and to respond to it accor...
An intelligent controller has the ability to analyse an unknown situation and to respond to it accor...
Adaptive dynamic programming (ADP) and reinforcement learning are quite relevant to each other when ...
Adaptive dynamic programming (ADP) and reinforcement learning are quite relevant to each other when ...
Keywords: Adaptive dynamic programming Neural network Markov jump systems Optimal control a b s t r ...
I herby declare that I am the sole author of this thesis. This is a true copy of the thesis, includi...