In the Bayesian reinforcement learning (RL) setting, a prior distribution over the unknown problem parameters -- the rewards and transitions -- is assumed, and a policy that optimizes the (posterior) expected return is sought. A common approximation, which has been recently popularized as meta-RL, is to train the agent on a sample of N problem instances from the prior, with the hope that for large enough N, good generalization behavior to an unseen test instance will be obtained. In this work, we study generalization in Bayesian RL under the probably approximately correct (PAC) framework, using the method of algorithmic stability. Our main contribution is showing that by adding regularization, the optimal policy becomes uniformly stable i...
Recently, Bayesian Optimization (BO) has been used to successfully optimize parametric policies in s...
We define notions of stability for learning algorithms and show how to use these notions to derive g...
Permission is hereby granted to the University of Alberta Libraries to reproduce single copies of th...
ICML 2019International audienceMany recent successful (deep) reinforcement learning algorithms make ...
This thesis explores Bayesian and variational inference in the context of solving the reinforcement ...
Monte-Carlo planning and Reinforcement Learning (RL) are essential to sequential decision making. Th...
In learning problems, avoiding to overfit the training data is of fundamental importance in order to...
Reinforcement learning (RL) is an important field of research in machine learning that is increasing...
Policy regularization methods such as maximum entropy regularization are widely used in reinforcemen...
We show that convex KL-regularized objective functions are obtained from a PAC-Bayes risk bound when...
This paper introduces a set of algorithms for Monte-Carlo Bayesian reinforcement learning. Firstly, ...
Abstract. In many Reinforcement Learning (RL) domains there is a high cost for generating experience...
We introduce a novel perspective on Bayesian reinforcement learning (RL); whereas existing approache...
Abstract-Reinforcement learning with linear and non-linear function approximation has been studied e...
We consider the problem of supervised learning with convex loss functions and propose a new form of ...
Recently, Bayesian Optimization (BO) has been used to successfully optimize parametric policies in s...
We define notions of stability for learning algorithms and show how to use these notions to derive g...
Permission is hereby granted to the University of Alberta Libraries to reproduce single copies of th...
ICML 2019International audienceMany recent successful (deep) reinforcement learning algorithms make ...
This thesis explores Bayesian and variational inference in the context of solving the reinforcement ...
Monte-Carlo planning and Reinforcement Learning (RL) are essential to sequential decision making. Th...
In learning problems, avoiding to overfit the training data is of fundamental importance in order to...
Reinforcement learning (RL) is an important field of research in machine learning that is increasing...
Policy regularization methods such as maximum entropy regularization are widely used in reinforcemen...
We show that convex KL-regularized objective functions are obtained from a PAC-Bayes risk bound when...
This paper introduces a set of algorithms for Monte-Carlo Bayesian reinforcement learning. Firstly, ...
Abstract. In many Reinforcement Learning (RL) domains there is a high cost for generating experience...
We introduce a novel perspective on Bayesian reinforcement learning (RL); whereas existing approache...
Abstract-Reinforcement learning with linear and non-linear function approximation has been studied e...
We consider the problem of supervised learning with convex loss functions and propose a new form of ...
Recently, Bayesian Optimization (BO) has been used to successfully optimize parametric policies in s...
We define notions of stability for learning algorithms and show how to use these notions to derive g...
Permission is hereby granted to the University of Alberta Libraries to reproduce single copies of th...