This work addresses the problem of inverse reinforcement learning in Markov decision processes where the decision-making agent is risk-sensitive. In particular, a risk-sensitive reinforcement learning algorithm with convergence guarantees that makes use of coherent risk metrics and models of human decision-making which have their origins in behavioral psychology and economics is presented. The risk-sensitive reinforcement learning algorithm provides the theoretical underpinning for a gradient-based inverse reinforcement learning algorithm that seeks to minimize a loss function defined on the observed behavior. It is shown that the gradient of the loss function with respect to the model parameters is well defined and computable via a contrac...
We address the problem of control in a risk-sensitive reinforcement learning (RL) context via distor...
Various methods for solving the inverse reinforcement learning (IRL) problem have been developed ind...
Keeping risk under control is a primary objective in many critical real-world domains, including fin...
We address the problem of inverse reinforcement learning in Markov decision processes where the agen...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
Classical game-theoretic approaches for multi-agent systems in both the forward policy design proble...
We study the problem of learning a policy in a Markov decision process (MDP) based on observations o...
This paper considers sequential decision making problems under uncertainty, the tradeoff between the...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Reinforcement Learning (RL) methods provide a solution for decision-making problems under uncertaint...
We derive a family of risk-sensitive reinforcement learning methods for agents, who face sequential ...
We adopt Markov Decision Processes (MDP) to model sequential decision problems, which have the chara...
peer reviewedClassical reinforcement learning (RL) techniques are generally concerned with the desig...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Reinforcement learning is one of the most promising machine learning techniques to get intelligent b...
We address the problem of control in a risk-sensitive reinforcement learning (RL) context via distor...
Various methods for solving the inverse reinforcement learning (IRL) problem have been developed ind...
Keeping risk under control is a primary objective in many critical real-world domains, including fin...
We address the problem of inverse reinforcement learning in Markov decision processes where the agen...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
Classical game-theoretic approaches for multi-agent systems in both the forward policy design proble...
We study the problem of learning a policy in a Markov decision process (MDP) based on observations o...
This paper considers sequential decision making problems under uncertainty, the tradeoff between the...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Reinforcement Learning (RL) methods provide a solution for decision-making problems under uncertaint...
We derive a family of risk-sensitive reinforcement learning methods for agents, who face sequential ...
We adopt Markov Decision Processes (MDP) to model sequential decision problems, which have the chara...
peer reviewedClassical reinforcement learning (RL) techniques are generally concerned with the desig...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Reinforcement learning is one of the most promising machine learning techniques to get intelligent b...
We address the problem of control in a risk-sensitive reinforcement learning (RL) context via distor...
Various methods for solving the inverse reinforcement learning (IRL) problem have been developed ind...
Keeping risk under control is a primary objective in many critical real-world domains, including fin...