This thesis explores structured, reward-based behaviour in artificial agents and in animals. In Part I we investigate how reinforcement learning agents can learn to cooperate. Drawing inspiration from the hierarchical organisation of human societies, we propose the framework of Feudal Multi-agent Hierarchies (FMH), in which coordination of many agents is facilitated by a manager agent. We outline the structure of FMH and demonstrate its potential for decentralised learning and control. We show that, given an adequate set of subgoals from which to choose, FMH performs, and particularly scales, substantially better than cooperative approaches that use shared rewards. We next investigate training FMH in simulation to solve a complex informatio...
Multiple agents have become increasingly utilized in various fields for both physical robots and sof...
This dissertation is concerned with the autonomous learning of behavioral models for sequential deci...
What is the role of real-time control and learning in the formation of social conventions? To answer...
We investigate how reinforcement learning agents can learn to cooperate. Drawing inspiration from hu...
“The original publication is available at www.springerlink.com” Copyright SpringerHierarchical struc...
In order to verify models of collective behaviors of animals, robots could be manipulated to impleme...
Brickworld is a simulated environment which has been developed as a testbed for learning and plannin...
The acquisition of hierarchies of reusable skills is one of the distinguishing characteristics of hu...
Many animal and human societies exhibit hierarchical structures with different degrees of steepness....
In our everyday lives, we must learn and utilize context-specific information to inform our decision...
The goal of the thesis is to study the role of the reward signal in deep reinforcement learning. The...
This paper presents a novel model of reinforcement learning agents. A feature of our learning agent ...
Many species are able to learn to associate behaviours with rewards as this gives fitness advantages...
For decades, neuroscientists and psychologists have observed that animal performance on spatial navi...
Team training in complex domains often requires a substantial number of resources, e.g. vehicles, ma...
Multiple agents have become increasingly utilized in various fields for both physical robots and sof...
This dissertation is concerned with the autonomous learning of behavioral models for sequential deci...
What is the role of real-time control and learning in the formation of social conventions? To answer...
We investigate how reinforcement learning agents can learn to cooperate. Drawing inspiration from hu...
“The original publication is available at www.springerlink.com” Copyright SpringerHierarchical struc...
In order to verify models of collective behaviors of animals, robots could be manipulated to impleme...
Brickworld is a simulated environment which has been developed as a testbed for learning and plannin...
The acquisition of hierarchies of reusable skills is one of the distinguishing characteristics of hu...
Many animal and human societies exhibit hierarchical structures with different degrees of steepness....
In our everyday lives, we must learn and utilize context-specific information to inform our decision...
The goal of the thesis is to study the role of the reward signal in deep reinforcement learning. The...
This paper presents a novel model of reinforcement learning agents. A feature of our learning agent ...
Many species are able to learn to associate behaviours with rewards as this gives fitness advantages...
For decades, neuroscientists and psychologists have observed that animal performance on spatial navi...
Team training in complex domains often requires a substantial number of resources, e.g. vehicles, ma...
Multiple agents have become increasingly utilized in various fields for both physical robots and sof...
This dissertation is concerned with the autonomous learning of behavioral models for sequential deci...
What is the role of real-time control and learning in the formation of social conventions? To answer...