Many stochastic search algorithms are designed to optimize a fixed objective function to learn a task, i.e., if the objective function changes slightly, for example, due to a change in the situation or context of the task, relearning is required to adapt to the new context. For instance, if we want to learn a kicking movement for a soccer robot, we have to relearn the movement for different ball locations. Such relearning is undesired as it is highly inefficient and many applications require a fast adaptation to a new context/situation. Therefore, we investigate contextual stochastic search algorithms that can learn multiple, similar tasks simultaneously. Current contextual stochastic search methods are based on policy search algorithms an...
International audienceWe derive a stochastic search procedure for parameter optimization from two fi...
International audienceContextual bandit algorithms are essential for solving many real-world interac...
In visual search, detection of a target is faster when a layout of nontarget items is repeatedly enc...
Many stochastic search algorithms are designed to optimize a fixed objective function to learn a tas...
Stochastic search algorithms have recently also gained a lot of attention in operations research, ma...
Stochastic search algorithms are black-box optimizer of an objective function. They have recently ga...
We consider the problem of learning skills that are versatilely applicable. One popular approach for...
Direct policy search has been successful in learning challenging real world robotic motor skills by ...
The MAP-i Doctoral Programme in Informatics, of the Universities of Minho, Aveiro and PortoOptimizat...
In robotics, lower-level controllers are typically used to make the robot solve a specific task in a...
Direct contextual policy search methods learn to improve policy parameters and simultaneously gener...
Trabajo presentado para la International Conference on Intelligent Robots and Systems (IRos), en Las...
In robotics, controllers make the robot solve a task within a specific context. The context can desc...
Contextual skill models enable robot to generalize parameterized skills for a range of task paramete...
CMA-ES is one of the most popular stochastic search algorithms. It performs favourably in many task...
International audienceWe derive a stochastic search procedure for parameter optimization from two fi...
International audienceContextual bandit algorithms are essential for solving many real-world interac...
In visual search, detection of a target is faster when a layout of nontarget items is repeatedly enc...
Many stochastic search algorithms are designed to optimize a fixed objective function to learn a tas...
Stochastic search algorithms have recently also gained a lot of attention in operations research, ma...
Stochastic search algorithms are black-box optimizer of an objective function. They have recently ga...
We consider the problem of learning skills that are versatilely applicable. One popular approach for...
Direct policy search has been successful in learning challenging real world robotic motor skills by ...
The MAP-i Doctoral Programme in Informatics, of the Universities of Minho, Aveiro and PortoOptimizat...
In robotics, lower-level controllers are typically used to make the robot solve a specific task in a...
Direct contextual policy search methods learn to improve policy parameters and simultaneously gener...
Trabajo presentado para la International Conference on Intelligent Robots and Systems (IRos), en Las...
In robotics, controllers make the robot solve a task within a specific context. The context can desc...
Contextual skill models enable robot to generalize parameterized skills for a range of task paramete...
CMA-ES is one of the most popular stochastic search algorithms. It performs favourably in many task...
International audienceWe derive a stochastic search procedure for parameter optimization from two fi...
International audienceContextual bandit algorithms are essential for solving many real-world interac...
In visual search, detection of a target is faster when a layout of nontarget items is repeatedly enc...