In real-world robotic applications, many factors, both at low-level (e.g., vision and motion control parameters) and at high-level (e.g., the behaviors) determine the quality of the robot performance. Thus, for many tasks, robots require fine tuning of the parameters, in the implementation of behaviors and basic control actions, as well as in strategic decisional processes. In recent years, machine learning techniques have been used to find optimal parameter sets for different behaviors. However, a drawback of learning techniques is time consumption: in practical applications, methods designed for physical robots must be effective with small amounts of data. In this paper, we present a method for concurrent learning of best strategy and opt...
Abstract — Slow convergence is a major problem for policy gradient methods. It is a consequence of t...
Policy Learning approaches are among the best suited methods for high-dimensional, continuous contro...
We present an in-depth survey of policy gradient methods as they are used in the machine learning co...
International audienceIn real-world robotic applications, many factors, both at low-level (e.g., vis...
In real-world robotic applications, many factors, both at low level (e.g., vision, motion control an...
In robotics, elementary behaviour patterns often tackle control theoretic problems. Because of incom...
Policy gradient algorithms have shown consider-able recent success in solving high-dimensional seque...
The acquisition and improvement of motor skills and control policies for robotics from trial and err...
This electronic version was submitted by the student author. The certified thesis is available in th...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
Abstract — The aquisition and improvement of motor skills and control policies for robotics from tri...
Reinforcement learning is a model-free technique to solve decision-making problems by learning the b...
International audienceIn humanoid robotic soccer, many factors, both at low-level (e.g., vision and ...
Abstract — Robot programmers can often quickly program a robot to approximately execute a task under...
This work presents a Deep Reinforcement Learning algorithm to control a differentially driven mobile...
Abstract — Slow convergence is a major problem for policy gradient methods. It is a consequence of t...
Policy Learning approaches are among the best suited methods for high-dimensional, continuous contro...
We present an in-depth survey of policy gradient methods as they are used in the machine learning co...
International audienceIn real-world robotic applications, many factors, both at low-level (e.g., vis...
In real-world robotic applications, many factors, both at low level (e.g., vision, motion control an...
In robotics, elementary behaviour patterns often tackle control theoretic problems. Because of incom...
Policy gradient algorithms have shown consider-able recent success in solving high-dimensional seque...
The acquisition and improvement of motor skills and control policies for robotics from trial and err...
This electronic version was submitted by the student author. The certified thesis is available in th...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
Abstract — The aquisition and improvement of motor skills and control policies for robotics from tri...
Reinforcement learning is a model-free technique to solve decision-making problems by learning the b...
International audienceIn humanoid robotic soccer, many factors, both at low-level (e.g., vision and ...
Abstract — Robot programmers can often quickly program a robot to approximately execute a task under...
This work presents a Deep Reinforcement Learning algorithm to control a differentially driven mobile...
Abstract — Slow convergence is a major problem for policy gradient methods. It is a consequence of t...
Policy Learning approaches are among the best suited methods for high-dimensional, continuous contro...
We present an in-depth survey of policy gradient methods as they are used in the machine learning co...