Despite the recent advances of deep reinforcement learning (DRL), agents trained by DRL tend to be brittle and sensitive to the training environment, especially in the multi-agent scenarios. In the multi-agent setting, a DRL agent’s policy can easily get stuck in a poor local optima w.r.t. its training partners – the learned policy may be only locally optimal to other agents’ current policies. In this paper, we focus on the problem of training robust DRL agents with continuous actions in the multi-agent learning setting so that the trained agents can still generalize when its opponents’ policies alter. To tackle this problem, we proposed a new algorithm, MiniMax Multi-agent Deep Deterministic Policy Gradient (M3DDPG) with the following cont...
Recent advances in Multi-agent Reinforcement Learning (MARL) have made it possible to implement vari...
This paper deals with distributed reinforcement learning problems with safety constraints. In partic...
Reinforcement learning has recently become a promising area of machine learning with significant ach...
In this paper, a novel Multi-agent Reinforcement Learning (MARL) approach, Multi-Agent Continuous Dy...
This paper demonstrates the need to develop more suitable decentralized reinforcement learning metho...
Machine Learning (ML) has been a remarkable success in the last few years, which Reinforcement Learn...
A plethora of real world problems, such as the control of autonomous vehicles and drones, packet de...
Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. Thi...
Many state-of-the-art cooperative multi-agent reinforcement learning (MARL) approaches, such as MADD...
A pursuit–evasion game is a classical maneuver confrontation problem in the multi-agent systems (MAS...
This project was motivated by seeking an AI method towards Artificial General Intelligence (AGI), th...
Deep deterministic policy gradient algorithm operating over continuous space of actions has attracte...
Reinforcement learning has been applied to solve several real world challenging problems, from robot...
Deep Reinforcement Learning has achieved a plenty of breakthroughs in the past decade. Motivated by ...
We propose FACtored Multi-Agent Centralised policy gradients (FACMAC), a new method for cooperative ...
Recent advances in Multi-agent Reinforcement Learning (MARL) have made it possible to implement vari...
This paper deals with distributed reinforcement learning problems with safety constraints. In partic...
Reinforcement learning has recently become a promising area of machine learning with significant ach...
In this paper, a novel Multi-agent Reinforcement Learning (MARL) approach, Multi-Agent Continuous Dy...
This paper demonstrates the need to develop more suitable decentralized reinforcement learning metho...
Machine Learning (ML) has been a remarkable success in the last few years, which Reinforcement Learn...
A plethora of real world problems, such as the control of autonomous vehicles and drones, packet de...
Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. Thi...
Many state-of-the-art cooperative multi-agent reinforcement learning (MARL) approaches, such as MADD...
A pursuit–evasion game is a classical maneuver confrontation problem in the multi-agent systems (MAS...
This project was motivated by seeking an AI method towards Artificial General Intelligence (AGI), th...
Deep deterministic policy gradient algorithm operating over continuous space of actions has attracte...
Reinforcement learning has been applied to solve several real world challenging problems, from robot...
Deep Reinforcement Learning has achieved a plenty of breakthroughs in the past decade. Motivated by ...
We propose FACtored Multi-Agent Centralised policy gradients (FACMAC), a new method for cooperative ...
Recent advances in Multi-agent Reinforcement Learning (MARL) have made it possible to implement vari...
This paper deals with distributed reinforcement learning problems with safety constraints. In partic...
Reinforcement learning has recently become a promising area of machine learning with significant ach...