Optimal control and Reinforcement Learning deal both with sequential decision-making problems, although they use different tools. In this thesis, we have investigated the connection between these two research areas. In particular, our contributions are twofold. In the first part of the thesis, we present and study an optimal control problem with uncertain dynamics. As a modeling assumption, we will suppose that the knowledge that an agent has on the current system is represented by a probability distribution π on the space of possible dynamics functions. The goal is to minimize an average cost functional, where the average is computed with respect to the probability distribution π. This framework describes well the behavior of a class of m...
The framework of dynamic programming (DP) and reinforcement learning (RL) can be used to express imp...
This survey is focused on certain sequential decision-making problems that involve optimizing over p...
This survey is focused on certain sequential decision-making problems that involve optimizing over p...
We consider an LQR optimal control problem with partially unknown dynamics. We propose a new model-b...
In this paper, we will deal with a linear quadratic optimal control problem with unknown dynamics. A...
In this paper, we combine optimal control theory and machine learning techniques to propose and solv...
Abstract—We present a reformulation of the stochastic optimal control problem in terms of KL diverge...
This paper reviews the current state of the art on reinforcement learning (RL)-based feedback contro...
Reinforcement learning (RL) and optimal control of systems with contin- uous states and actions requ...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
International audienceReinforcement learning (RL) is a machine learning answer to the optimal contro...
International audienceReinforcement learning (RL) is a machine learning answer to the optimal contro...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
International audienceReinforcement learning (RL) is a machine learning answer to the optimal contro...
The framework of dynamic programming (DP) and reinforcement learning (RL) can be used to express imp...
This survey is focused on certain sequential decision-making problems that involve optimizing over p...
This survey is focused on certain sequential decision-making problems that involve optimizing over p...
We consider an LQR optimal control problem with partially unknown dynamics. We propose a new model-b...
In this paper, we will deal with a linear quadratic optimal control problem with unknown dynamics. A...
In this paper, we combine optimal control theory and machine learning techniques to propose and solv...
Abstract—We present a reformulation of the stochastic optimal control problem in terms of KL diverge...
This paper reviews the current state of the art on reinforcement learning (RL)-based feedback contro...
Reinforcement learning (RL) and optimal control of systems with contin- uous states and actions requ...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
International audienceReinforcement learning (RL) is a machine learning answer to the optimal contro...
International audienceReinforcement learning (RL) is a machine learning answer to the optimal contro...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
International audienceReinforcement learning (RL) is a machine learning answer to the optimal contro...
The framework of dynamic programming (DP) and reinforcement learning (RL) can be used to express imp...
This survey is focused on certain sequential decision-making problems that involve optimizing over p...
This survey is focused on certain sequential decision-making problems that involve optimizing over p...