A central goal of the robotics community is to develop general optimization algorithms for producing high-performance dynamic behaviors in robot systems. This goal is challenging because many robot control tasks are characterized by significant stochasticity, high-dimensionality, expensive evaluations, and unknown or unreliable system models. Despite these challenges, a range of algorithms exist for performing efficient optimization of parameterized control policies with respect to average cost criteria. However, other statistics of the cost may also be important. In particular, for many stochastic control problems, it can be advantageous to select policies based not only on their average cost, but also their variance (or risk). In this the...
abstract: The goal of reinforcement learning is to enable systems to autonomously solve tasks in the...
Abstract — For controlling high-dimensional robots, most stochastic optimal control algorithms use a...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
A central goal of the robotics community is to develop general optimization algorithms for producing...
This thesis studies the problem of designing reliable control laws of robotic systems operating in u...
With the increasing pace of automation, modern robotic systems need to act in stochastic, non-statio...
Reinforcement learning methods are being applied to control problems in robotics domain. These algor...
Data-driven approaches hold the promise of creating the next wave of robots that can perform diverse...
Presented on April 15, 2015 at 12:00 p.m. at the Manufacturing Related Disciplines Complex (MRDC), G...
Thesis: S.B., Massachusetts Institute of Technology, Department of Mechanical Engineering, 2020Catal...
For controlling high-dimensional robots, most stochastic optimal control algorithms use approximatio...
We present a mean-variance policy iteration (MVPI) framework for risk-averse control in a discounted...
Deployment in hazardous environments requires robots to understand the risks associated with their a...
How does uncertainty affect a robot when attempting to generate a control policy to achieve some obj...
As robotic agents become increasingly present in human environments, task completion rates during hu...
abstract: The goal of reinforcement learning is to enable systems to autonomously solve tasks in the...
Abstract — For controlling high-dimensional robots, most stochastic optimal control algorithms use a...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
A central goal of the robotics community is to develop general optimization algorithms for producing...
This thesis studies the problem of designing reliable control laws of robotic systems operating in u...
With the increasing pace of automation, modern robotic systems need to act in stochastic, non-statio...
Reinforcement learning methods are being applied to control problems in robotics domain. These algor...
Data-driven approaches hold the promise of creating the next wave of robots that can perform diverse...
Presented on April 15, 2015 at 12:00 p.m. at the Manufacturing Related Disciplines Complex (MRDC), G...
Thesis: S.B., Massachusetts Institute of Technology, Department of Mechanical Engineering, 2020Catal...
For controlling high-dimensional robots, most stochastic optimal control algorithms use approximatio...
We present a mean-variance policy iteration (MVPI) framework for risk-averse control in a discounted...
Deployment in hazardous environments requires robots to understand the risks associated with their a...
How does uncertainty affect a robot when attempting to generate a control policy to achieve some obj...
As robotic agents become increasingly present in human environments, task completion rates during hu...
abstract: The goal of reinforcement learning is to enable systems to autonomously solve tasks in the...
Abstract — For controlling high-dimensional robots, most stochastic optimal control algorithms use a...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...