This thesis presents a novel hierarchical learning framework, Reinforcement Learning Optimal Control, for controlling nonlinear dynamical systems with continuous states and actions. The adapted approach mimics the neural computations that allow our brain to bridge across the divide between symbolic action-selection and low-level actuation control by operating at two levels of abstraction. First, current findings demonstrate that at the level of limb coordination human behaviour is explained by linear optimal feedback control theory, where cost functions match energy and timing constraints of tasks. Second, humans learn cognitive tasks involving learning symbolic level action selection, in terms of both model-free and model-based rein...
Abstract — In this paper we introduce an online algorithm that uses integral reinforcement knowledge...
Many neural control systems are at least roughly optimized, but how is optimal control learned in t...
A model-based reinforcement learning algorithm is developed in this paper for fixed-final-time optim...
In this contribution, we discuss Reinforcement Learning as an alternative way to solve optimal contr...
This work describes the theoretical development and practical application of transition point dynam...
This work describes the theoretical development and practical application of transition point dynam...
Safety critical control is often trained in a simulated environment to mitigate risk. Subsequent mig...
Safety critical control is often trained in a simulated environment to mitigate risk. Subsequent mig...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Autonomous systems extend upon human capabilities and can be equipped with superhuman attributes in ...
This paper proposes a neural network based reinforcement learning controller that is able to learn c...
This paper reviews an almost new method for the design of optimal decision making controllers named ...
The efficient control of complex dynamical systems has many applications in the natural and applied ...
A continuous-time, continuous-state version of the temporal differ-ence (TD) algorithm is derived in...
Abstract — In this paper we introduce an online algorithm that uses integral reinforcement knowledge...
Many neural control systems are at least roughly optimized, but how is optimal control learned in t...
A model-based reinforcement learning algorithm is developed in this paper for fixed-final-time optim...
In this contribution, we discuss Reinforcement Learning as an alternative way to solve optimal contr...
This work describes the theoretical development and practical application of transition point dynam...
This work describes the theoretical development and practical application of transition point dynam...
Safety critical control is often trained in a simulated environment to mitigate risk. Subsequent mig...
Safety critical control is often trained in a simulated environment to mitigate risk. Subsequent mig...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Autonomous systems extend upon human capabilities and can be equipped with superhuman attributes in ...
This paper proposes a neural network based reinforcement learning controller that is able to learn c...
This paper reviews an almost new method for the design of optimal decision making controllers named ...
The efficient control of complex dynamical systems has many applications in the natural and applied ...
A continuous-time, continuous-state version of the temporal differ-ence (TD) algorithm is derived in...
Abstract — In this paper we introduce an online algorithm that uses integral reinforcement knowledge...
Many neural control systems are at least roughly optimized, but how is optimal control learned in t...
A model-based reinforcement learning algorithm is developed in this paper for fixed-final-time optim...