International audienceWe present a new algorithm called policy iteration plus (PI +) for the optimal control of nonlinear deterministic discrete-time plants with general cost functions. PI + builds upon classical policy iteration and has the distinctive feature to enforce recursive feasibility under mild conditions, in the sense that the minimization problems solved at each iteration are guaranteed to admit a solution. While recursive feasibility is a desired property, it appears that existing results on the policy iteration algorithm fail to ensure it in general, contrary to PI +. We also establish the recursive stability of PI + : the policies generated at each iteration ensure a stability property for the closed-loop system. We prove our...
Optimal control theory has a long history and broad applications. Motivated by the goal of obtaining...
Abstract. We present an accelerated algorithm for the solution of static Hamilton-Jacobi-Bellman equ...
Convergence of the policy iteration method for discrete and continuous optimal control problems hold...
International audienceWe present a new algorithm called policy iteration plus (PI +) for the optimal...
We consider the problem of learning discounted-cost optimal control policies for unknown determinist...
Value iteration is a method to generate optimal control inputs for generic nonlinear systems and cos...
Approximate policy iteration (API) is studied to solve undiscounted optimal control problems in this...
We consider the discrete-time infinite-horizon optimal control problem formalized by Markov de-cisio...
This thesis studies approximate optimal control of nonlinear systems. Particular attention is given ...
Abstract — This paper is concerned with a new discrete-time policy iteration adaptive dynamic progra...
Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Mechanical Engineering, 1998.Include...
Trajectory-Centric Reinforcement Learning and Trajectory Optimization methods optimize a sequence of...
Recent research indicates that perturbation analysis (PA), Markov decision processes (MDP), and rein...
We consider infinite horizon dynamic programming problems, where the control at each stage consists ...
Approximate policy iteration is a class of reinforcement learning (RL) algorithms where the policy i...
Optimal control theory has a long history and broad applications. Motivated by the goal of obtaining...
Abstract. We present an accelerated algorithm for the solution of static Hamilton-Jacobi-Bellman equ...
Convergence of the policy iteration method for discrete and continuous optimal control problems hold...
International audienceWe present a new algorithm called policy iteration plus (PI +) for the optimal...
We consider the problem of learning discounted-cost optimal control policies for unknown determinist...
Value iteration is a method to generate optimal control inputs for generic nonlinear systems and cos...
Approximate policy iteration (API) is studied to solve undiscounted optimal control problems in this...
We consider the discrete-time infinite-horizon optimal control problem formalized by Markov de-cisio...
This thesis studies approximate optimal control of nonlinear systems. Particular attention is given ...
Abstract — This paper is concerned with a new discrete-time policy iteration adaptive dynamic progra...
Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Mechanical Engineering, 1998.Include...
Trajectory-Centric Reinforcement Learning and Trajectory Optimization methods optimize a sequence of...
Recent research indicates that perturbation analysis (PA), Markov decision processes (MDP), and rein...
We consider infinite horizon dynamic programming problems, where the control at each stage consists ...
Approximate policy iteration is a class of reinforcement learning (RL) algorithms where the policy i...
Optimal control theory has a long history and broad applications. Motivated by the goal of obtaining...
Abstract. We present an accelerated algorithm for the solution of static Hamilton-Jacobi-Bellman equ...
Convergence of the policy iteration method for discrete and continuous optimal control problems hold...