Reinforcement learning (RL) techniques optimize the accumulated long-term reward of a suitably chosen reward function. However, designing such a reward function often requires a lot of task- specific prior knowledge. The designer needs to consider different objectives that do not only influence the learned behavior but also the learning progress. To alleviate these issues, preference-based reinforcement learning algorithms (PbRL) have been proposed that can directly learn from an expert's preferences instead of a hand-designed numeric reward. PbRL has gained traction in recent years due to its ability to resolve the reward shaping problem, its ability to learn from non numeric rewards and the possibility to reduce the dependence on expert k...
Over the course of the last decade, the framework of reinforcement learning has developed into a pro...
Practical implementations of deep reinforcement learning (deep RL) have been challenging due to an a...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Reinforcement learning (RL) techniques optimize the accumulated long-term reward of a suitably chose...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
Conveying complex objectives to reinforcement learning (RL) agents often requires meticulous reward ...
International audienceThis paper focuses on reinforcement learning (RL) with limited prior knowledge...
Common reinforcement learning algorithms assume access to a numeric feedback signal. The numeric fee...
Specifying a numeric reward function for reinforcement learning typically requires a lot of hand-tun...
In this paper, we address the issue of fairness in preference-based reinforcement learning (PbRL) in...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
We consider the problem of preference based reinforcement learning (PbRL), where, unlike traditional...
Specifying a numeric reward function for reinforcement learning typically requires a lot of hand-tun...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
Over the course of the last decade, the framework of reinforcement learning has developed into a pro...
Practical implementations of deep reinforcement learning (deep RL) have been challenging due to an a...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Reinforcement learning (RL) techniques optimize the accumulated long-term reward of a suitably chose...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
Conveying complex objectives to reinforcement learning (RL) agents often requires meticulous reward ...
International audienceThis paper focuses on reinforcement learning (RL) with limited prior knowledge...
Common reinforcement learning algorithms assume access to a numeric feedback signal. The numeric fee...
Specifying a numeric reward function for reinforcement learning typically requires a lot of hand-tun...
In this paper, we address the issue of fairness in preference-based reinforcement learning (PbRL) in...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
We consider the problem of preference based reinforcement learning (PbRL), where, unlike traditional...
Specifying a numeric reward function for reinforcement learning typically requires a lot of hand-tun...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
Over the course of the last decade, the framework of reinforcement learning has developed into a pro...
Practical implementations of deep reinforcement learning (deep RL) have been challenging due to an a...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...