Offline reinforcement learning -- learning a policy from a batch of data -- is known to be hard for general MDPs. These results motivate the need to look at specific classes of MDPs where offline reinforcement learning might be feasible. In this work, we explore a restricted class of MDPs to obtain guarantees for offline reinforcement learning. The key property, which we call Action Impact Regularity (AIR), is that actions primarily impact a part of the state (an endogenous component) with limited impact on the remaining part of the state (an exogenous component). AIR is a strong assumption, but it nonetheless holds in a number of real-world domains including financial markets. We discuss algorithms that exploits the AIR property, and provi...
International audienceOffline Reinforcement Learning (RL) aims at learning an optimal control from a...
Offline reinforcement learning aims to utilize datasets of previously gathered environment-action in...
Reinforcement Learning (RL) agents can solve general problems based on little to no knowledge of the...
Offline reinforcement learning enables learning from a fixed dataset, without further interactions w...
Offline reinforcement learning (RL) struggles in environments with rich and noisy inputs, where the ...
In this dissertation we develop new methodologies and frameworks to address challenges in offline re...
We present a model-based offline reinforcement learning policy performance lower bound that explicit...
Offline reinforcement learning (RL) extends the paradigm of classical RL algorithms to purely learni...
Policy constraint methods to offline reinforcement learning (RL) typically utilize parameterization ...
Offline Reinforcement Learning (RL) aims at learning an optimal control from a fixed dataset, withou...
Offline reinforcement learning (RL) aims to learn the optimal policy from a pre-collected dataset wi...
We study the problem of safe offline reinforcement learning (RL), the goal is to learn a policy that...
The application of Reinforcement Learning (RL) in real world environments can be expensive or risky ...
Offline reinforcement learning algorithms still lack trust in practice due to the risk that the lear...
Sample-efficient offline reinforcement learning (RL) with linear function approximation has been stu...
International audienceOffline Reinforcement Learning (RL) aims at learning an optimal control from a...
Offline reinforcement learning aims to utilize datasets of previously gathered environment-action in...
Reinforcement Learning (RL) agents can solve general problems based on little to no knowledge of the...
Offline reinforcement learning enables learning from a fixed dataset, without further interactions w...
Offline reinforcement learning (RL) struggles in environments with rich and noisy inputs, where the ...
In this dissertation we develop new methodologies and frameworks to address challenges in offline re...
We present a model-based offline reinforcement learning policy performance lower bound that explicit...
Offline reinforcement learning (RL) extends the paradigm of classical RL algorithms to purely learni...
Policy constraint methods to offline reinforcement learning (RL) typically utilize parameterization ...
Offline Reinforcement Learning (RL) aims at learning an optimal control from a fixed dataset, withou...
Offline reinforcement learning (RL) aims to learn the optimal policy from a pre-collected dataset wi...
We study the problem of safe offline reinforcement learning (RL), the goal is to learn a policy that...
The application of Reinforcement Learning (RL) in real world environments can be expensive or risky ...
Offline reinforcement learning algorithms still lack trust in practice due to the risk that the lear...
Sample-efficient offline reinforcement learning (RL) with linear function approximation has been stu...
International audienceOffline Reinforcement Learning (RL) aims at learning an optimal control from a...
Offline reinforcement learning aims to utilize datasets of previously gathered environment-action in...
Reinforcement Learning (RL) agents can solve general problems based on little to no knowledge of the...