In this article we study the connection of stochastic optimal control and reinforcement learning. Our main motivation is an importance sampling application to rare events sampling which can be reformulated as an optimal control problem. By using a parameterized approach the optimal control problem turns into a stochastic optimization problem which still presents some open questions regarding how to tackle the scalability to high-dimensional problems and how to deal with the intrinsic metastability of the system. With the aim to explore new methods we connect the optimal control problem to reinforcement learning since both share the same underlying framework namely a Markov decision process (MDP). We show how the MDP can be formulated for th...
In this handout we analyse reinforcement learning algorithms for Markov decision processes. The read...
International audienceIn reinforcement learning, an agent collects information interacting with an e...
This paper investigates to what extent one can improve reinforcement learning algorithms. Our study ...
We introduce a sensitivity-based view to the area of learning and optimization of stochastic dynamic...
The topic of this thesis is stochastic optimal control and reinforcement learning. Our aim is to uni...
Learning and optimization of stochastic systems is a multi-disciplinary area that attracts wide atte...
In this paper, we present a brief survey of reinforcement learning, with particular emphasis on stoc...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
Abstract—We present a reformulation of the stochastic optimal control problem in terms of KL diverge...
While stochastic optimal control, together with associate formulations like Reinforcement Learning,...
Following the novel paradigm developed by Van Roy and coauthors for reinforcement learning in arbitr...
In this paper, we study the optimal stopping problem in the so-called exploratory framework, in whic...
This thesis dives into the theory of discrete time stochastic optimal control through exploring dyna...
2014-10-14This dissertation addresses some problems in the area of learning, optimization and decisi...
The exploration/exploitation dilemma is a fundamental but often computationally intractable problem ...
In this handout we analyse reinforcement learning algorithms for Markov decision processes. The read...
International audienceIn reinforcement learning, an agent collects information interacting with an e...
This paper investigates to what extent one can improve reinforcement learning algorithms. Our study ...
We introduce a sensitivity-based view to the area of learning and optimization of stochastic dynamic...
The topic of this thesis is stochastic optimal control and reinforcement learning. Our aim is to uni...
Learning and optimization of stochastic systems is a multi-disciplinary area that attracts wide atte...
In this paper, we present a brief survey of reinforcement learning, with particular emphasis on stoc...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
Abstract—We present a reformulation of the stochastic optimal control problem in terms of KL diverge...
While stochastic optimal control, together with associate formulations like Reinforcement Learning,...
Following the novel paradigm developed by Van Roy and coauthors for reinforcement learning in arbitr...
In this paper, we study the optimal stopping problem in the so-called exploratory framework, in whic...
This thesis dives into the theory of discrete time stochastic optimal control through exploring dyna...
2014-10-14This dissertation addresses some problems in the area of learning, optimization and decisi...
The exploration/exploitation dilemma is a fundamental but often computationally intractable problem ...
In this handout we analyse reinforcement learning algorithms for Markov decision processes. The read...
International audienceIn reinforcement learning, an agent collects information interacting with an e...
This paper investigates to what extent one can improve reinforcement learning algorithms. Our study ...