Existing offline reinforcement learning (RL) methods face a few major challenges, particularly the distributional shift between the learned policy and the behavior policy. Offline Meta-RL is emerging as a promising approach to address these challenges, aiming to learn an informative meta-policy from a collection of tasks. Nevertheless, as shown in our empirical studies, offline Meta-RL could be outperformed by offline single-task RL methods on tasks with good quality of datasets, indicating that a right balance has to be delicately calibrated between "exploring" the out-of-distribution state-actions by following the meta-policy and "exploiting" the offline dataset by staying close to the behavior policy. Motivated by such empirical analysis...
International audienceOffline Reinforcement Learning (RL) aims at learning an optimal control from a...
Offline reinforcement learning (RL) aims to learn policy from the passively collected offline datase...
Meta-reinforcement learning enables artificial agents to learn from related training tasks and adapt...
Offline reinforcement learning (RL) extends the paradigm of classical RL algorithms to purely learni...
Policy constraint methods to offline reinforcement learning (RL) typically utilize parameterization ...
Meta-reinforcement learning (RL) methods can meta-train policies that adapt to new tasks with orders...
In this dissertation we develop new methodologies and frameworks to address challenges in offline re...
Offline reinforcement learning -- learning a policy from a batch of data -- is known to be hard for ...
Existing offline reinforcement learning (RL) algorithms typically assume that training data is eithe...
Offline reinforcement learning enables learning from a fixed dataset, without further interactions w...
We present a model-based offline reinforcement learning policy performance lower bound that explicit...
Conventional reinforcement learning (RL) needs an environment to collect fresh data, which is imprac...
Offline Reinforcement Learning (RL) aims at learning an optimal control from a fixed dataset, withou...
Offline reinforcement learning (RL) methods can generally be categorized into two types: RL-based an...
The ability to discover optimal behaviour from fixed data sets has the potential to transfer the suc...
International audienceOffline Reinforcement Learning (RL) aims at learning an optimal control from a...
Offline reinforcement learning (RL) aims to learn policy from the passively collected offline datase...
Meta-reinforcement learning enables artificial agents to learn from related training tasks and adapt...
Offline reinforcement learning (RL) extends the paradigm of classical RL algorithms to purely learni...
Policy constraint methods to offline reinforcement learning (RL) typically utilize parameterization ...
Meta-reinforcement learning (RL) methods can meta-train policies that adapt to new tasks with orders...
In this dissertation we develop new methodologies and frameworks to address challenges in offline re...
Offline reinforcement learning -- learning a policy from a batch of data -- is known to be hard for ...
Existing offline reinforcement learning (RL) algorithms typically assume that training data is eithe...
Offline reinforcement learning enables learning from a fixed dataset, without further interactions w...
We present a model-based offline reinforcement learning policy performance lower bound that explicit...
Conventional reinforcement learning (RL) needs an environment to collect fresh data, which is imprac...
Offline Reinforcement Learning (RL) aims at learning an optimal control from a fixed dataset, withou...
Offline reinforcement learning (RL) methods can generally be categorized into two types: RL-based an...
The ability to discover optimal behaviour from fixed data sets has the potential to transfer the suc...
International audienceOffline Reinforcement Learning (RL) aims at learning an optimal control from a...
Offline reinforcement learning (RL) aims to learn policy from the passively collected offline datase...
Meta-reinforcement learning enables artificial agents to learn from related training tasks and adapt...