This work extends and compares some recent model+learning-based methodologies for optimal control with input saturation. We focus on two methodologies: a model-based actor-critic (MBAC) strategy, and a nonlinear policy iteration strategy. To evaluate the performance of the algorithms, these strategies are applied to the swinging up an inverted pendulum. Numerical simulations show that the neural network approximation in the MBAC strategy can be poor, and the algorithm may converge far from the optimum. In the MBAC approach neither stabilization nor monotonic convergence can be guaranteed, and it is observed that the best value function is not always corresponding to the last one. On the other side the nonlinear policy iteration approach gua...
The infinite-horizon optimal control problem for nonlinear systems is studied. In the context of mod...
The infinite-horizon optimal control problem for nonlinear systems is studied. In the context of mod...
The design of optimal control laws for nonlinear systems is tackled without knowledge of the underly...
Inverted pendulums have been classic setups in the control laboratories since the 1950s. They were o...
This paper presents an online policy iteration (PI) algorithm to learn the continuous-time optimal c...
Cover: Saturated policy for the pendulum swing-up problem as learned by the model learning actor-cri...
Applying reinforcement learning to control systems enables the use of machine learning to develop el...
Abstract — In this paper we introduce an online algorithm that uses integral reinforcement knowledge...
In this contribution, we discuss Reinforcement Learning as an alternative way to solve optimal contr...
The development of computational power is constantly on the rise and makes for new possibilities in ...
The development of computational power is constantly on the rise and makes for new possibilities in ...
The infinite-horizon optimal control problem for nonlinear systems is studied. In the context of mod...
Model-free reinforcement learning and nonlinear model predictive control are two different approache...
The infinite-horizon optimal control problem for nonlinear systems is studied. In the context of mod...
The infinite-horizon optimal control problem for nonlinear systems is studied. In the context of mod...
The infinite-horizon optimal control problem for nonlinear systems is studied. In the context of mod...
The infinite-horizon optimal control problem for nonlinear systems is studied. In the context of mod...
The design of optimal control laws for nonlinear systems is tackled without knowledge of the underly...
Inverted pendulums have been classic setups in the control laboratories since the 1950s. They were o...
This paper presents an online policy iteration (PI) algorithm to learn the continuous-time optimal c...
Cover: Saturated policy for the pendulum swing-up problem as learned by the model learning actor-cri...
Applying reinforcement learning to control systems enables the use of machine learning to develop el...
Abstract — In this paper we introduce an online algorithm that uses integral reinforcement knowledge...
In this contribution, we discuss Reinforcement Learning as an alternative way to solve optimal contr...
The development of computational power is constantly on the rise and makes for new possibilities in ...
The development of computational power is constantly on the rise and makes for new possibilities in ...
The infinite-horizon optimal control problem for nonlinear systems is studied. In the context of mod...
Model-free reinforcement learning and nonlinear model predictive control are two different approache...
The infinite-horizon optimal control problem for nonlinear systems is studied. In the context of mod...
The infinite-horizon optimal control problem for nonlinear systems is studied. In the context of mod...
The infinite-horizon optimal control problem for nonlinear systems is studied. In the context of mod...
The infinite-horizon optimal control problem for nonlinear systems is studied. In the context of mod...
The design of optimal control laws for nonlinear systems is tackled without knowledge of the underly...