A central goal of the robotics community is to develop general optimization algorithms for producing high-performance dynamic behaviors in robot systems. This goal is challenging because many robot control tasks are characterized by significant stochasticity, high-dimensionality, expensive evaluations, and unknown or unreliable system models. Despite these challenges, a range of algorithms exist for performing efficient optimization of parameterized control policies with respect to average cost criteria. However, other statistics of the cost may also be important. In particular, for many stochastic control problems, it can be advantageous to select policies based not only on their average cost, but also their variance (or risk). In this the...
This paper proposes a simulation-based active policy learning algorithm for finite-horizon, partiall...
When controlling dynamic systems such as mobile robots in uncertain environments, there is a trade o...
Abstract. In this paper we improve learning performance of a risk-aware robot facing navigation task...
A central goal of the robotics community is to develop general optimization algorithms for producing...
With the increasing pace of automation, modern robotic systems need to act in stochastic, non-statio...
Reinforcement learning methods are being applied to control problems in robotics domain. These algor...
This thesis studies the problem of designing reliable control laws of robotic systems operating in u...
Thesis: S.B., Massachusetts Institute of Technology, Department of Mechanical Engineering, 2020Catal...
Abstract — For controlling high-dimensional robots, most stochastic optimal control algorithms use a...
Keeping risk under control is a primary objective in many critical real-world domains, including fin...
We present a mean-variance policy iteration (MVPI) framework for risk-averse control in a discounted...
For controlling high-dimensional robots, most stochastic optimal control algorithms use approximatio...
How does uncertainty affect a robot when attempting to generate a control policy to achieve some obj...
In this paper we improve learning performance of a risk-aware robot facing navigation tasks by emplo...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
This paper proposes a simulation-based active policy learning algorithm for finite-horizon, partiall...
When controlling dynamic systems such as mobile robots in uncertain environments, there is a trade o...
Abstract. In this paper we improve learning performance of a risk-aware robot facing navigation task...
A central goal of the robotics community is to develop general optimization algorithms for producing...
With the increasing pace of automation, modern robotic systems need to act in stochastic, non-statio...
Reinforcement learning methods are being applied to control problems in robotics domain. These algor...
This thesis studies the problem of designing reliable control laws of robotic systems operating in u...
Thesis: S.B., Massachusetts Institute of Technology, Department of Mechanical Engineering, 2020Catal...
Abstract — For controlling high-dimensional robots, most stochastic optimal control algorithms use a...
Keeping risk under control is a primary objective in many critical real-world domains, including fin...
We present a mean-variance policy iteration (MVPI) framework for risk-averse control in a discounted...
For controlling high-dimensional robots, most stochastic optimal control algorithms use approximatio...
How does uncertainty affect a robot when attempting to generate a control policy to achieve some obj...
In this paper we improve learning performance of a risk-aware robot facing navigation tasks by emplo...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
This paper proposes a simulation-based active policy learning algorithm for finite-horizon, partiall...
When controlling dynamic systems such as mobile robots in uncertain environments, there is a trade o...
Abstract. In this paper we improve learning performance of a risk-aware robot facing navigation task...