Inverse reinforcement learning has proved its ability to explain state-action trajectories of expert agents by recovering their underlying reward functions in increasingly challenging environments. Recent advances in adversarial learning have allowed extending inverse RL to applications with non-stationary environment dynamics unknown to the agents, arbitrary structures of reward functions and improved handling of the ambiguities inherent to the ill-posed nature of inverse RL. This is particularly relevant in real time applications on stochastic environments involving risk, like volatile financial markets. Moreover, recent work on simulation of complex environments enable learning algorithms to engage with real market data through si...
Inverse Reinforcement Learning (IRL) learns an optimal policy, given some expert demonstrations, thu...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
Reinforcement learning has shown promise in learning policies that can solve complex problems. Howev...
Inverse reinforcement learning has proved its ability to explain state-action trajectories of expert...
Multi-agent learning is a promising method to simulate aggregate competitive behaviour in finance. L...
Mean field games (MFGs) provide a mathematically tractable framework for modelling large-scale multi...
We show that adversarial reinforcement learning (ARL) can be used to produce market marking agents t...
The task of learning a reward function from expert demonstrations suffers from high sample complexit...
Reinforcement learning is one of the most promising machine learning techniques to get intelligent b...
In traditional Reinforcement Learning (RL) [4], a single agent learns to act in an environment by op...
This work addresses the problem of inverse reinforcement learning in Markov decision processes where...
International audienceInverse Reinforcement Learning (IRL) is an effective approach to recover a rew...
Based on the premise that the most succinct representation of the behavior of an entity is its rewar...
We evaluate the robustness of reward functions learned with IRL, when transferred to similar tasks. ...
The application of reinforcement learning (RL) to algorithmic trading is, in many ways, a perfect ma...
Inverse Reinforcement Learning (IRL) learns an optimal policy, given some expert demonstrations, thu...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
Reinforcement learning has shown promise in learning policies that can solve complex problems. Howev...
Inverse reinforcement learning has proved its ability to explain state-action trajectories of expert...
Multi-agent learning is a promising method to simulate aggregate competitive behaviour in finance. L...
Mean field games (MFGs) provide a mathematically tractable framework for modelling large-scale multi...
We show that adversarial reinforcement learning (ARL) can be used to produce market marking agents t...
The task of learning a reward function from expert demonstrations suffers from high sample complexit...
Reinforcement learning is one of the most promising machine learning techniques to get intelligent b...
In traditional Reinforcement Learning (RL) [4], a single agent learns to act in an environment by op...
This work addresses the problem of inverse reinforcement learning in Markov decision processes where...
International audienceInverse Reinforcement Learning (IRL) is an effective approach to recover a rew...
Based on the premise that the most succinct representation of the behavior of an entity is its rewar...
We evaluate the robustness of reward functions learned with IRL, when transferred to similar tasks. ...
The application of reinforcement learning (RL) to algorithmic trading is, in many ways, a perfect ma...
Inverse Reinforcement Learning (IRL) learns an optimal policy, given some expert demonstrations, thu...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
Reinforcement learning has shown promise in learning policies that can solve complex problems. Howev...