We present an empirical, gradient-based method for solving data-driven stochastic optimal control problems using the theory of kernel embeddings of distributions. By embedding the integral operator of a stochastic kernel in a reproducing kernel Hilbert space, we can compute an empirical approximation of stochastic optimal control problems, which can then be solved efficiently using the properties of the RKHS. Existing approaches typically rely upon finite control spaces or optimize over policies with finite support to enable optimization. In contrast, our approach uses kernel-based gradients computed using observed data to approximate the cost surface of the optimal control problem, which can then be optimized using gradient descent. We app...
We consider the problem of efficiently estimating gradients from stochastic simulation. Although the...
Abstract—We present a reformulation of the stochastic optimal control problem in terms of KL diverge...
The goal of this thesis is to develop a mathematical framework for optimal, accurate, and affordable...
We present an embedding of stochastic optimal control problems, of the so called path integral form,...
Stochastic Optimal Control is an elegant and general framework for specifying and solving control pr...
We present an in-depth survey of policy gradient methods as they are used in the machine learning co...
While stochastic optimal control, together with associate formulations like Reinforcement Learning,...
AbstractWe propose a stochastic gradient descent algorithm for learning the gradient of a regression...
The study of optimal control problems under uncertainty plays an important role in scientific numeri...
Much recent work in reinforcement learning and stochastic optimal control has focused on algorithms ...
The goal of this thesis is to develop a mathematical framework for optimal, accurate, and affordable...
The goal of this thesis is to develop a mathematical framework for autonomous behavior. We begin by ...
We focus on solving closed-loop stochastic problems, and propose a perturbed gradient algorithm to a...
International audienceIt is often said that control and estimation problems are in duality. Recently...
The goal of this thesis is to develop a mathematical framework for autonomous behavior. We begin by ...
We consider the problem of efficiently estimating gradients from stochastic simulation. Although the...
Abstract—We present a reformulation of the stochastic optimal control problem in terms of KL diverge...
The goal of this thesis is to develop a mathematical framework for optimal, accurate, and affordable...
We present an embedding of stochastic optimal control problems, of the so called path integral form,...
Stochastic Optimal Control is an elegant and general framework for specifying and solving control pr...
We present an in-depth survey of policy gradient methods as they are used in the machine learning co...
While stochastic optimal control, together with associate formulations like Reinforcement Learning,...
AbstractWe propose a stochastic gradient descent algorithm for learning the gradient of a regression...
The study of optimal control problems under uncertainty plays an important role in scientific numeri...
Much recent work in reinforcement learning and stochastic optimal control has focused on algorithms ...
The goal of this thesis is to develop a mathematical framework for optimal, accurate, and affordable...
The goal of this thesis is to develop a mathematical framework for autonomous behavior. We begin by ...
We focus on solving closed-loop stochastic problems, and propose a perturbed gradient algorithm to a...
International audienceIt is often said that control and estimation problems are in duality. Recently...
The goal of this thesis is to develop a mathematical framework for autonomous behavior. We begin by ...
We consider the problem of efficiently estimating gradients from stochastic simulation. Although the...
Abstract—We present a reformulation of the stochastic optimal control problem in terms of KL diverge...
The goal of this thesis is to develop a mathematical framework for optimal, accurate, and affordable...