Reinforcement learning deals with the problem of sequential decision making in uncertain stochastic environments. In this thesis I deal with agents who attempt to solve the reinforcement learning problem online and in real-time. This presents experimental challenges for which I introduce novel kernelised algorithms. Kernel algorithms are very useful in reinforcement learning settings as they enable learning in situations where a very high-dimensional or hand engineered feature vector would otherwise be required. Furthermore, I attempt to address the theoretical challenges which arise from online on-policy algorithms, for which I introduce a type of analysis which is novel (and useful) to reinforcement learning in its lack of restrictive ass...
In this paper, we provide two new stable online algorithms for the problem of prediction in reinforc...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
In this study, we extend the framework of semiparametric statistical inference introduced recently t...
We introduce and empirically evaluate two novel online gradient-based reinforcement learning algorit...
Reinforcement learning is a general computational framework for learning sequential decision strate...
Reinforcement learning (RL) is a computational framework for learning sequential decision strategies...
In this paper, we present a brief survey of reinforcement learning, with particular emphasis on stoc...
Kernel-based reinforcement learning (KBRL) stands out among approximate re-inforcement learning algo...
We consider the regret minimization problem in reinforcement learning (RL) in the episodic setting. ...
Reinforcement learning (RL) focuses on an essential aspect of intelligent behavior – how an agent ca...
Letter: Communicated by Masa-aki Sato.Function approximation in online, incremental, reinforcement l...
Abstract. We present a kernel-based approach to reinforcement learning that overcomes the stability ...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
Reinforcement learning is often done using parameterized function approximators to store value funct...
In this paper, we provide two new stable online algorithms for the problem of prediction in reinforc...
In this paper, we provide two new stable online algorithms for the problem of prediction in reinforc...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
In this study, we extend the framework of semiparametric statistical inference introduced recently t...
We introduce and empirically evaluate two novel online gradient-based reinforcement learning algorit...
Reinforcement learning is a general computational framework for learning sequential decision strate...
Reinforcement learning (RL) is a computational framework for learning sequential decision strategies...
In this paper, we present a brief survey of reinforcement learning, with particular emphasis on stoc...
Kernel-based reinforcement learning (KBRL) stands out among approximate re-inforcement learning algo...
We consider the regret minimization problem in reinforcement learning (RL) in the episodic setting. ...
Reinforcement learning (RL) focuses on an essential aspect of intelligent behavior – how an agent ca...
Letter: Communicated by Masa-aki Sato.Function approximation in online, incremental, reinforcement l...
Abstract. We present a kernel-based approach to reinforcement learning that overcomes the stability ...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
Reinforcement learning is often done using parameterized function approximators to store value funct...
In this paper, we provide two new stable online algorithms for the problem of prediction in reinforc...
In this paper, we provide two new stable online algorithms for the problem of prediction in reinforc...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
In this study, we extend the framework of semiparametric statistical inference introduced recently t...