In recent years, Multi-Agent Path Finding (MAPF) has attracted attention from the fields of both Operations Research (OR) and Reinforcement Learning (RL). However, in the 2021 Flatland3 Challenge, a competition on MAPF, the best RL method scored only 27.9, far less than the best OR method. This paper proposes a new RL solution to Flatland3 Challenge, which scores 125.3, several times higher than the best RL solution before. We creatively apply a novel network architecture, TreeLSTM, to MAPF in our solution. Together with several other RL techniques, including reward shaping, multiple-phase training, and centralized control, our solution is comparable to the top 2-3 OR methods.Comment: Appear in AAAI23-MAP
Reinforcement learning tree-based planning methods have been gaining popularity in the last few year...
We tackle the problem of cooperative visual exploration where multiple agents need to jointly explor...
In this work, we consider the Multi-Agent Pickup-and-Delivery (MAPD) problem, where agents constantl...
Multi-Agent Path Finding (MAPF) is the combinatorial problem of finding collision-free paths for mul...
Multi-Agent Path Finding (MAPF) is the combinatorial problem of finding collision-free paths for mul...
The multi-agent path finding (MAPF) problem is a generalization of the single-agent path finding pro...
This paper addresses the challenges of real-time, large-scale, and near-optimal multi-agent pathfind...
Multi-Agent Pathfinding (MAPF) is a problem in which the goal is to plan paths for distinct agents w...
Multi-Agent Path Finding (MAPF) has been widely studied in the AI community. For example, Conflict-B...
Thesis (MEng)--Stellenbosch University, 2021.ENGLISH ABSTRACT: Navigation systems are becoming large...
Modern optimal multi-agent path finding (MAPF) algorithms can scale to solve problems with hundreds ...
A multi-agent path finding (MAPF) problem is concerned with finding paths for multiple agents such t...
Multi-Agent Path Finding (MAPF) is the computational problem of constructing collision-free paths fo...
We propose a novel complete algorithm for multi-agent pathfinding (MAPF) called lazy constraints add...
Multi Agent Path Finding (MAPF) is widely needed to coordinate real-world robotic systems. New appro...
Reinforcement learning tree-based planning methods have been gaining popularity in the last few year...
We tackle the problem of cooperative visual exploration where multiple agents need to jointly explor...
In this work, we consider the Multi-Agent Pickup-and-Delivery (MAPD) problem, where agents constantl...
Multi-Agent Path Finding (MAPF) is the combinatorial problem of finding collision-free paths for mul...
Multi-Agent Path Finding (MAPF) is the combinatorial problem of finding collision-free paths for mul...
The multi-agent path finding (MAPF) problem is a generalization of the single-agent path finding pro...
This paper addresses the challenges of real-time, large-scale, and near-optimal multi-agent pathfind...
Multi-Agent Pathfinding (MAPF) is a problem in which the goal is to plan paths for distinct agents w...
Multi-Agent Path Finding (MAPF) has been widely studied in the AI community. For example, Conflict-B...
Thesis (MEng)--Stellenbosch University, 2021.ENGLISH ABSTRACT: Navigation systems are becoming large...
Modern optimal multi-agent path finding (MAPF) algorithms can scale to solve problems with hundreds ...
A multi-agent path finding (MAPF) problem is concerned with finding paths for multiple agents such t...
Multi-Agent Path Finding (MAPF) is the computational problem of constructing collision-free paths fo...
We propose a novel complete algorithm for multi-agent pathfinding (MAPF) called lazy constraints add...
Multi Agent Path Finding (MAPF) is widely needed to coordinate real-world robotic systems. New appro...
Reinforcement learning tree-based planning methods have been gaining popularity in the last few year...
We tackle the problem of cooperative visual exploration where multiple agents need to jointly explor...
In this work, we consider the Multi-Agent Pickup-and-Delivery (MAPD) problem, where agents constantl...