Three major challenges in reinforcement learning are the complex dynamical systems with large state spaces, the costly data acquisition processes, and the deviation of real-world dynamics from the training environment deployment. To overcome these issues, we study distributionally robust Markov decision processes with continuous state spaces under the widely used Kullback-Leibler, chi-square, and total variation uncertainty sets. We propose a model-based approach that utilizes Gaussian Processes and the maximum variance reduction algorithm to efficiently learn multi-output nominal transition dynamics, leveraging access to a generative model (i.e., simulator). We further demonstrate the statistical sample complexity of the proposed method fo...
International audienceWe consider the problem of learning the optimal action-value function in disco...
Reinforcement learning (RL) has become a highly successful framework for learning in Markov decision...
Markov decision processes (MDP) is a standard modeling tool for sequential decision making in a dyna...
The Robust Markov Decision Process (RMDP) framework focuses on designing control policies that are r...
In real scenarios, state observations that an agent observes may contain measurement errors or adver...
This paper concerns the central issues of model robustness and sample efficiency in offline reinforc...
The curse of dimensionality is a widely known issue in reinforcement learning (RL). In the tabular s...
Applying the reinforcement learning methodology to domains that involve risky decisions like medicin...
The impetus for exploration in reinforcement learning (RL) is decreasing uncertainty about the envir...
Finding an optimal policy in a reinforcement learning (RL) framework with continuous state and actio...
International audienceWe consider the problem of learning the optimal action-value function in the d...
This book examines Gaussian processes in both model-based reinforcement learning (RL) and inference ...
Distributional Reinforcement Learning (RL) maintains the entire probability distribution of the rewa...
Trial-and-error based reinforcement learning (RL) has seen rapid advancements in recent times, espe...
Control of nonlinear systems on continuous domains is a challenging task for various reasons. For ro...
International audienceWe consider the problem of learning the optimal action-value function in disco...
Reinforcement learning (RL) has become a highly successful framework for learning in Markov decision...
Markov decision processes (MDP) is a standard modeling tool for sequential decision making in a dyna...
The Robust Markov Decision Process (RMDP) framework focuses on designing control policies that are r...
In real scenarios, state observations that an agent observes may contain measurement errors or adver...
This paper concerns the central issues of model robustness and sample efficiency in offline reinforc...
The curse of dimensionality is a widely known issue in reinforcement learning (RL). In the tabular s...
Applying the reinforcement learning methodology to domains that involve risky decisions like medicin...
The impetus for exploration in reinforcement learning (RL) is decreasing uncertainty about the envir...
Finding an optimal policy in a reinforcement learning (RL) framework with continuous state and actio...
International audienceWe consider the problem of learning the optimal action-value function in the d...
This book examines Gaussian processes in both model-based reinforcement learning (RL) and inference ...
Distributional Reinforcement Learning (RL) maintains the entire probability distribution of the rewa...
Trial-and-error based reinforcement learning (RL) has seen rapid advancements in recent times, espe...
Control of nonlinear systems on continuous domains is a challenging task for various reasons. For ro...
International audienceWe consider the problem of learning the optimal action-value function in disco...
Reinforcement learning (RL) has become a highly successful framework for learning in Markov decision...
Markov decision processes (MDP) is a standard modeling tool for sequential decision making in a dyna...