It is cooperation that essentially differentiates multi-agent systems (MASs) from single-agent intelligence. In realistic MAS applications such as RoboCup, repeated work has shown that traditional machine learning (ML) approaches have difficulty mapping directly from cooperative behaviours to actuator outputs. To overcome this problem, vertical layered architectures are commonly used to break cooperation down into behavioural layers; ML has then been used to generate different low-level skills, and a planning mechanism added to create high-level cooperation. We propose a novel method called Policy Search Planning (PSP), in which Policy Search is used to find an optimal policy for selecting plans from a plan pool. PSP extends an existing gra...
Bayesian inference is an appealing approach for leveraging prior knowledge in reinforcement learning...
Reinforcement learning tree-based planning methods have been gaining popularity in the last few year...
Multi-Agent Path Finding (MAPF) is the computational problem of constructing collision-free paths fo...
Abstract. OxBlue2009 (2D) is a robot football team for RoboCup 2D simulation. In this paper, the dec...
Recent superhuman results in games have largely been achieved in a variety of zero-sum settings, suc...
The cooperate behavior that emerges from the interactions among simple multi-agent robots along with...
Abstract: In Reinforcement Learning, Unsupervised Skill Discovery tackles the learning of several po...
Autonomous unmanned vehicles (UxVs) can be useful in many scenarios including disaster relief, prod...
In robotics, controllers make the robot solve a task within a specific context. The context can des...
Robots will play a crucial role in the future and need to work as a team in increasingly more comple...
When autonomous agents attempt to coordinate action, it is often necessary that they reach some kind...
Trabajo presentado en el International Conference on Robotics and Automation (ICRA), celebrado de fo...
© Copyright 2017, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rig...
One of the most difficult unsolved tasks in the field of multi-agent modeling is to discover common ...
We consider the problem of learning skills that are versatilely applicable. One popular approach for...
Bayesian inference is an appealing approach for leveraging prior knowledge in reinforcement learning...
Reinforcement learning tree-based planning methods have been gaining popularity in the last few year...
Multi-Agent Path Finding (MAPF) is the computational problem of constructing collision-free paths fo...
Abstract. OxBlue2009 (2D) is a robot football team for RoboCup 2D simulation. In this paper, the dec...
Recent superhuman results in games have largely been achieved in a variety of zero-sum settings, suc...
The cooperate behavior that emerges from the interactions among simple multi-agent robots along with...
Abstract: In Reinforcement Learning, Unsupervised Skill Discovery tackles the learning of several po...
Autonomous unmanned vehicles (UxVs) can be useful in many scenarios including disaster relief, prod...
In robotics, controllers make the robot solve a task within a specific context. The context can des...
Robots will play a crucial role in the future and need to work as a team in increasingly more comple...
When autonomous agents attempt to coordinate action, it is often necessary that they reach some kind...
Trabajo presentado en el International Conference on Robotics and Automation (ICRA), celebrado de fo...
© Copyright 2017, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rig...
One of the most difficult unsolved tasks in the field of multi-agent modeling is to discover common ...
We consider the problem of learning skills that are versatilely applicable. One popular approach for...
Bayesian inference is an appealing approach for leveraging prior knowledge in reinforcement learning...
Reinforcement learning tree-based planning methods have been gaining popularity in the last few year...
Multi-Agent Path Finding (MAPF) is the computational problem of constructing collision-free paths fo...