This work tackles in-situ robotics: the goal is to learn a policy while the robot operates in the real-world, with neither ground truth nor rewards. The proposed approach is based on preference-based policy learning: Iteratively, the robot demonstrates a few policies, is informed of the expert's preferences about the demonstrated policies, constructs a utility function compatible with all expert preferences, uses it in a self-training phase, and demonstrates in the next iteration a new policy. While in previous work, the new policy was one maximizing the current utility function, this paper uses active ranking to select the most informative policy (Vi-appiani and Boutilier 2010). The challenge is the following. The policy return estima...
In real-world robotic applications, many factors, both at low-level (e.g., vision and motion control...
Robots are often highly non-linear dynamical systems with many degrees of freedom, making solving mo...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
International audienceThis paper focuses on reinforcement learning (RL) with limited prior knowledge...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
Abstract. Many machine learning approaches in robotics, based on re-inforcement learning, inverse op...
International audienceMany machine learning approaches in robotics, based on re- inforcement learnin...
International audienceAimed at on-board robot training, an approach hybridizing active preference le...
As most action generation problems of autonomous robots can be phrased in terms of sequential decisi...
A summary of the state-of-the-art reinforcement learning in robotics is given, in terms of both algo...
Our goal is to efficiently learn reward functions encoding a human's preferences for how a dynamical...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
The field of deep reinforcement learning has seen major successes recently, achieving superhuman per...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
In real-world robotic applications, many factors, both at low-level (e.g., vision and motion control...
Robots are often highly non-linear dynamical systems with many degrees of freedom, making solving mo...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
International audienceThis paper focuses on reinforcement learning (RL) with limited prior knowledge...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
Abstract. Many machine learning approaches in robotics, based on re-inforcement learning, inverse op...
International audienceMany machine learning approaches in robotics, based on re- inforcement learnin...
International audienceAimed at on-board robot training, an approach hybridizing active preference le...
As most action generation problems of autonomous robots can be phrased in terms of sequential decisi...
A summary of the state-of-the-art reinforcement learning in robotics is given, in terms of both algo...
Our goal is to efficiently learn reward functions encoding a human's preferences for how a dynamical...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
The field of deep reinforcement learning has seen major successes recently, achieving superhuman per...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
In real-world robotic applications, many factors, both at low-level (e.g., vision and motion control...
Robots are often highly non-linear dynamical systems with many degrees of freedom, making solving mo...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...