Reinforcement learning problems are often phrased in terms of Markov decision processes (MDPs). In this thesis we go beyond MDPs and consider reinforcement learning in environments that are non-Markovian, non-ergodic and only partially observable. Our focus is not on practical algorithms, but rather on the fundamental underlying problems: How do we balance exploration and exploitation? How do we explore optimally? When is an agent optimal? We follow the nonparametric realizable paradigm: we assume the data is drawn from an unknown source that belongs to a known countable class of candidates. First, we consider the passive (sequence prediction) setting, learning from data that is n...
We study the framework of optimal decision making under uncertainty where the agents do not know the...
We study the framework of optimal decision making under uncertainty where the agents do not know the...
Artificial general intelligence aims to create agents capable of learning to solve arbitrary intere...
We address the problem of reinforcement learning in which observations may exhibit an arbitrary form...
General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observatio...
In reinforcement learning the task for an agent is to attain the best possible asymptotic reward wh...
The quintessential model-based reinforcement-learning agent iteratively refines its estimates or pri...
In reinforcement learning the task for an agent is to attain the best possible asymptotic reward wh...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Following the novel paradigm developed by Van Roy and coauthors for reinforcement learning in arbitr...
This paper introduces a principled approach for the design of a scalable general reinforcement lear...
We consider the problem of minimizing the long term average expected regret of an agent in an online...
Making intelligent decisions from incomplete information is critical in many applications: for examp...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
We investigate learning in a setting where each period a population has to choose between two action...
We study the framework of optimal decision making under uncertainty where the agents do not know the...
We study the framework of optimal decision making under uncertainty where the agents do not know the...
Artificial general intelligence aims to create agents capable of learning to solve arbitrary intere...
We address the problem of reinforcement learning in which observations may exhibit an arbitrary form...
General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observatio...
In reinforcement learning the task for an agent is to attain the best possible asymptotic reward wh...
The quintessential model-based reinforcement-learning agent iteratively refines its estimates or pri...
In reinforcement learning the task for an agent is to attain the best possible asymptotic reward wh...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Following the novel paradigm developed by Van Roy and coauthors for reinforcement learning in arbitr...
This paper introduces a principled approach for the design of a scalable general reinforcement lear...
We consider the problem of minimizing the long term average expected regret of an agent in an online...
Making intelligent decisions from incomplete information is critical in many applications: for examp...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
We investigate learning in a setting where each period a population has to choose between two action...
We study the framework of optimal decision making under uncertainty where the agents do not know the...
We study the framework of optimal decision making under uncertainty where the agents do not know the...
Artificial general intelligence aims to create agents capable of learning to solve arbitrary intere...