We present a policy search method that uses iteratively refitted local linear models to optimize trajectory distributions for large, continuous problems. These tra-jectory distributions can be used within the framework of guided policy search to learn policies with an arbitrary parameterization. Our method fits time-varying linear dynamics models to speed up learning, but does not rely on learning a global model, which can be difficult when the dynamics are complex and discontinuous. We show that this hybrid approach requires many fewer samples than model-free methods, and can handle complex, nonsmooth dynamics that can pose a challenge for model-based techniques. We present experiments showing that our method can be used to learn complex n...
In order to learn effective control policies for dynamical systems, policy search methods must be ab...
Most policy search (PS) algorithms require thousands of training episodes to find an effective polic...
Computational agents often need to learn policies that involve many control variables, e.g., a robot...
We present a policy search method that uses iteratively refitted local linear models to optimize tra...
Direct policy search methods offer the promise of automatically learning controllers for com-plex, h...
Abstract—Reinforcement learning and policy search methods can in principle solve a wide range of con...
The Iterative Linear Quadratic Regulator (ILQR), a variant of Differential Dynamic Programming (DDP)...
We present an Imitation Learning approach for the control of dynamical systems with a known model. ...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
In robotics, controllers make the robot solve a task within a specific context. The context can des...
We present an algorithm for policy search in stochastic dynamical systems using model-based reinforc...
© 2014 Elsevier B.V.In robotics, lower-level controllers are typically used to make the robot solve ...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...
The poster presents a new family of deep neural network-based dynamic systems and how these dynamics...
Policy search is a subfield in reinforcement learning which focuses on finding good parameters for ...
In order to learn effective control policies for dynamical systems, policy search methods must be ab...
Most policy search (PS) algorithms require thousands of training episodes to find an effective polic...
Computational agents often need to learn policies that involve many control variables, e.g., a robot...
We present a policy search method that uses iteratively refitted local linear models to optimize tra...
Direct policy search methods offer the promise of automatically learning controllers for com-plex, h...
Abstract—Reinforcement learning and policy search methods can in principle solve a wide range of con...
The Iterative Linear Quadratic Regulator (ILQR), a variant of Differential Dynamic Programming (DDP)...
We present an Imitation Learning approach for the control of dynamical systems with a known model. ...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
In robotics, controllers make the robot solve a task within a specific context. The context can des...
We present an algorithm for policy search in stochastic dynamical systems using model-based reinforc...
© 2014 Elsevier B.V.In robotics, lower-level controllers are typically used to make the robot solve ...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...
The poster presents a new family of deep neural network-based dynamic systems and how these dynamics...
Policy search is a subfield in reinforcement learning which focuses on finding good parameters for ...
In order to learn effective control policies for dynamical systems, policy search methods must be ab...
Most policy search (PS) algorithms require thousands of training episodes to find an effective polic...
Computational agents often need to learn policies that involve many control variables, e.g., a robot...