Sample efficiency is one of the key factors when applying policy search to real-world problems. In recent years, Bayesian Optimization (BO) has become prominent in the field of robotics due to its sample efficiency and little prior knowledge needed. However, one drawback of BO is its poor performance on high-dimensional search spaces as it focuses on global search. In the policy search setting, local optimization is typically sufficient as initial policies are often available, e.g., via meta-learning, kinesthetic demonstrations or sim-to-real approaches. In this paper, we propose to constrain the policy search space to a sublevel-set of the Bayesian surrogate model’s predictive uncertainty. This simple yet effective way of constraining the ...
To cope with varying conditions, motor primitives (MPs) must support generalization over task parame...
Often we have to handle high dimensional spaces if we want to learn motor skills for robots. In poli...
Deep neural networks have recently become astonishingly successful at many machine learning problems...
Bayesian Optimization (BO) is an effective method for optimizing expensive-to-evaluate black-box fun...
Most policy search (PS) algorithms require thousands of training episodes to find an effective polic...
International audienceMost policy search algorithms require thousands of training episodes to find a...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
Bayesian optimization is renowned for its sample efficiency but its application to higher dimension...
International audienceThe most data-efficient algorithms for reinforcement learning in robotics are ...
Bayesian optimization has recently emerged in the machine learning community as a very effective aut...
International audienceMost policy search (PS) algorithms require thousands of training episodes to f...
Humans excel at confronting problems with little to no prior information about, and with few interac...
Humans excel at confronting problems with little to no prior information about, and with few interac...
Humans excel at confronting problems with little to no prior information about, and with few interac...
To cope with varying conditions, motor primitives (MPs) must support generalization over task parame...
Often we have to handle high dimensional spaces if we want to learn motor skills for robots. In poli...
Deep neural networks have recently become astonishingly successful at many machine learning problems...
Bayesian Optimization (BO) is an effective method for optimizing expensive-to-evaluate black-box fun...
Most policy search (PS) algorithms require thousands of training episodes to find an effective polic...
International audienceMost policy search algorithms require thousands of training episodes to find a...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
Bayesian optimization is renowned for its sample efficiency but its application to higher dimension...
International audienceThe most data-efficient algorithms for reinforcement learning in robotics are ...
Bayesian optimization has recently emerged in the machine learning community as a very effective aut...
International audienceMost policy search (PS) algorithms require thousands of training episodes to f...
Humans excel at confronting problems with little to no prior information about, and with few interac...
Humans excel at confronting problems with little to no prior information about, and with few interac...
Humans excel at confronting problems with little to no prior information about, and with few interac...
To cope with varying conditions, motor primitives (MPs) must support generalization over task parame...
Often we have to handle high dimensional spaces if we want to learn motor skills for robots. In poli...
Deep neural networks have recently become astonishingly successful at many machine learning problems...