Human-in-the-loop reinforcement learning (RL) methods actively integrate human knowledge to create reward functions for various robotic tasks. Learning from preferences shows promise as alleviates the requirement of demonstrations by querying humans on state-action sequences. However, the limited granularity of sequence-based approaches complicates temporal credit assignment. The amount of human querying is contingent on query quality, as redundant queries result in excessive human involvement. This paper addresses the often-overlooked aspect of query selection, which is closely related to active learning (AL). We propose a novel query selection approach that leverages variational autoencoder (VAE) representations of state sequences. In thi...
Pool-based sampling Kernel function a b s t r a c t Active learning has received great interests fro...
Practical implementations of deep reinforcement learning (deep RL) have been challenging due to an a...
| openaire: EC/H2020/637991/EU//COMPUTEDThis paper investigates Active Robot Learning strategies tha...
Our goal is to efficiently learn reward functions encoding a human's preferences for how a dynamical...
International audienceThis paper focuses on reinforcement learning (RL) with limited prior knowledge...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
Conveying complex objectives to reinforcement learning (RL) agents often requires meticulous reward ...
This work tackles in-situ robotics: the goal is to learn a policy while the robot operates in the re...
Abstract—While reward functions are an essential component of many robot learning methods, defining ...
This paper investigates how to make improved action selection for online policy learning in robotic ...
Most active learning approaches select either informative or representative unla-beled instances to ...
With the goal of having robots learn new skills after deployment, we propose an active learning fram...
The use of robotic systems outside the branch of tasks currently common in industry requires the dev...
Mobile robots are increasingly being employed for performing complex tasks in dynamic environments. ...
Pool-based sampling Kernel function a b s t r a c t Active learning has received great interests fro...
Practical implementations of deep reinforcement learning (deep RL) have been challenging due to an a...
| openaire: EC/H2020/637991/EU//COMPUTEDThis paper investigates Active Robot Learning strategies tha...
Our goal is to efficiently learn reward functions encoding a human's preferences for how a dynamical...
International audienceThis paper focuses on reinforcement learning (RL) with limited prior knowledge...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
Conveying complex objectives to reinforcement learning (RL) agents often requires meticulous reward ...
This work tackles in-situ robotics: the goal is to learn a policy while the robot operates in the re...
Abstract—While reward functions are an essential component of many robot learning methods, defining ...
This paper investigates how to make improved action selection for online policy learning in robotic ...
Most active learning approaches select either informative or representative unla-beled instances to ...
With the goal of having robots learn new skills after deployment, we propose an active learning fram...
The use of robotic systems outside the branch of tasks currently common in industry requires the dev...
Mobile robots are increasingly being employed for performing complex tasks in dynamic environments. ...
Pool-based sampling Kernel function a b s t r a c t Active learning has received great interests fro...
Practical implementations of deep reinforcement learning (deep RL) have been challenging due to an a...
| openaire: EC/H2020/637991/EU//COMPUTEDThis paper investigates Active Robot Learning strategies tha...