My research attempts to address on-line action selection in reinforcement learning from a Bayesian perspective. The idea is to develop more effective action selection techniques by exploiting information in a Bayesian posterior, while also selecting actions by growing an adaptive, sparse lookahead tree. I further augment the approach by considering a new value function approximation strategy for the belief-state Markov decision processes induced by Bayesian learning. Bayesian Reinforcement Learning Imagine a mobile vendor robot (“vendorbot”) loaded with snacks and bustling around a building, learning where to visit to optimize its profit. The robot must choose wisely between selling snacks somewhere far away from its hom
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Recently, Bayesian Optimization (BO) has been used to successfully optimize parametric policies in s...
Making intelligent decisions from incomplete information is critical in many applications: for examp...
My research attempts to address on-line action selection in reinforcement learning from a Bayesian p...
We present a Bayesian reinforcement learning model with a working memory module which can solve some...
Presentation given at the MAA Southeast Section. Abstract In a Markov Decision Process, an agent mus...
Thesis (Ph.D.)--University of Washington, 2020Informed and robust decision making in the face of unc...
In this work we consider probabilistic approaches to sequential decision making. The ultimate goal i...
In this work we consider probabilistic approaches to sequential decision making. The ultimate goal i...
peer reviewedThis chapter surveys recent lines of work that use Bayesian techniques for reinforcemen...
This thesis explores Bayesian and variational inference in the context of solving the reinforcement ...
We address the problem of Bayesian reinforcement learning using efficient model-based online plannin...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
Effectively leveraging model structure in reinforcement learning is a difficult task, but failure to...
Institute of Perception, Action and BehaviourRecently there has been a good deal of interest in usin...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Recently, Bayesian Optimization (BO) has been used to successfully optimize parametric policies in s...
Making intelligent decisions from incomplete information is critical in many applications: for examp...
My research attempts to address on-line action selection in reinforcement learning from a Bayesian p...
We present a Bayesian reinforcement learning model with a working memory module which can solve some...
Presentation given at the MAA Southeast Section. Abstract In a Markov Decision Process, an agent mus...
Thesis (Ph.D.)--University of Washington, 2020Informed and robust decision making in the face of unc...
In this work we consider probabilistic approaches to sequential decision making. The ultimate goal i...
In this work we consider probabilistic approaches to sequential decision making. The ultimate goal i...
peer reviewedThis chapter surveys recent lines of work that use Bayesian techniques for reinforcemen...
This thesis explores Bayesian and variational inference in the context of solving the reinforcement ...
We address the problem of Bayesian reinforcement learning using efficient model-based online plannin...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
Effectively leveraging model structure in reinforcement learning is a difficult task, but failure to...
Institute of Perception, Action and BehaviourRecently there has been a good deal of interest in usin...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Recently, Bayesian Optimization (BO) has been used to successfully optimize parametric policies in s...
Making intelligent decisions from incomplete information is critical in many applications: for examp...