In general, it is difficult to determine an optimal closed-loop policy in nonlinear control problems with continuous-valued state and control domains. Hence, approximations are often inevitable. The standard method of discretizing states and controls suffers from the curse of dimensionality and strongly depends on the chosen temporal sampling rate. In this paper, we introduce Gaussian process dynamic programming (GPDP) and determine an approximate globally optimal closed-loop policy. In GPDP, value functions in the Bellman recursion of the dynamic programming algorithm are modeled using Gaussian processes. GPDP returns an optimal statefeedback for a finite set of states. Based on these outcomes, we learn a possibly discontinuous closed-loop...
We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learn...
This thesis studies approximate optimal control of nonlinear systems. Particular attention is given ...
The control of complex systems can be done decomposing the control task into a sequence of control m...
In general, it is difficult to determine an optimal closed-loop policy in nonlinear control problems...
In general, it is difficult to determine an optimal closed-loop policy in nonlinear control problems...
Reinforcement learning (RL) and optimal control of systems with contin- uous states and actions requ...
Reinforcement learning (RL) and optimal control of systems with contin- uous states and actions requ...
Finding an optimal policy in a reinforcement learning (RL) framework with continuous state and actio...
Finding an optimal policy in a reinforcement learning (RL) framework with continuous state and actio...
Finding an optimal policy in a reinforcement learning (RL) framework with continuous state and actio...
This paper presents an approximate policy iteration algorithm for solving infinite-horizon, discount...
This paper presents an approximate policy iteration algorithm for solving infinite-horizon, discount...
In this paper, we introduce a novel model-based approach to solving the important subclass of partia...
Control of nonlinear systems on continuous domains is a challenging task for various reasons. For ro...
*authors contributed equally Abstract—In this paper we present a fully automated ap-proach to (appro...
We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learn...
This thesis studies approximate optimal control of nonlinear systems. Particular attention is given ...
The control of complex systems can be done decomposing the control task into a sequence of control m...
In general, it is difficult to determine an optimal closed-loop policy in nonlinear control problems...
In general, it is difficult to determine an optimal closed-loop policy in nonlinear control problems...
Reinforcement learning (RL) and optimal control of systems with contin- uous states and actions requ...
Reinforcement learning (RL) and optimal control of systems with contin- uous states and actions requ...
Finding an optimal policy in a reinforcement learning (RL) framework with continuous state and actio...
Finding an optimal policy in a reinforcement learning (RL) framework with continuous state and actio...
Finding an optimal policy in a reinforcement learning (RL) framework with continuous state and actio...
This paper presents an approximate policy iteration algorithm for solving infinite-horizon, discount...
This paper presents an approximate policy iteration algorithm for solving infinite-horizon, discount...
In this paper, we introduce a novel model-based approach to solving the important subclass of partia...
Control of nonlinear systems on continuous domains is a challenging task for various reasons. For ro...
*authors contributed equally Abstract—In this paper we present a fully automated ap-proach to (appro...
We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learn...
This thesis studies approximate optimal control of nonlinear systems. Particular attention is given ...
The control of complex systems can be done decomposing the control task into a sequence of control m...