Multi-agent reinforcement learning (MRL) is a growing area of research. What makes it particularly challenging is that multiple learners render each other’s environments non-stationary. In addition to adapting their behaviors to other learning agents, online learners must also provide assurances about their online performance in order to promote user trust of adaptive agent systems deployed in real world applications. In this article, instead of developing new algorithms with such assurances, we study the question of safety in online performance of some existing MRL algorithms. We identify the key notion of reactivity of a learner by analyzing how an algorithm (PHC-Exploiter), designed to exploit some simpler opponents, can itself be exploi...
Reinforcement learning agents interacting with a complex environment like the real world are unlikel...
We expose the danger of reward poisoning in offline multi-agent reinforcement learning (MARL), where...
This study presents a unified resilient model-free reinforcement learning (RL) based distributed con...
Multi-agent reinforcement learning (MRL) is a growing area of research. What makes it particularly c...
The aim of multi-agent reinforcement learning systems is to provide interacting agents with the abil...
We present an approach to safely reduce the communication required between agents in a Multi-Agent R...
Reinforcement Learning (RL) agents can solve general problems based on little to no knowledge of the...
Despite increasing deployment of agent technologies in several business and industry domains, user c...
Reinforcement learning, especially deep reinforcement learning, has made many advances in the last d...
Despite increasing deployment of agent technologies in several business and industry domains, user c...
Many state-of-the-art cooperative multi-agent reinforcement learning (MARL) approaches, such as MADD...
Typically in reinforcement learning, agents are trained and evaluated on the same environment. Conse...
Training reinforcement learning agents in real-world environments is costly, particularly for safety...
Reinforcement learning in complex environments may require supervision to prevent the agent from att...
Reinforcement learning (RL) is a general method for agents to learn optimal control policies through...
Reinforcement learning agents interacting with a complex environment like the real world are unlikel...
We expose the danger of reward poisoning in offline multi-agent reinforcement learning (MARL), where...
This study presents a unified resilient model-free reinforcement learning (RL) based distributed con...
Multi-agent reinforcement learning (MRL) is a growing area of research. What makes it particularly c...
The aim of multi-agent reinforcement learning systems is to provide interacting agents with the abil...
We present an approach to safely reduce the communication required between agents in a Multi-Agent R...
Reinforcement Learning (RL) agents can solve general problems based on little to no knowledge of the...
Despite increasing deployment of agent technologies in several business and industry domains, user c...
Reinforcement learning, especially deep reinforcement learning, has made many advances in the last d...
Despite increasing deployment of agent technologies in several business and industry domains, user c...
Many state-of-the-art cooperative multi-agent reinforcement learning (MARL) approaches, such as MADD...
Typically in reinforcement learning, agents are trained and evaluated on the same environment. Conse...
Training reinforcement learning agents in real-world environments is costly, particularly for safety...
Reinforcement learning in complex environments may require supervision to prevent the agent from att...
Reinforcement learning (RL) is a general method for agents to learn optimal control policies through...
Reinforcement learning agents interacting with a complex environment like the real world are unlikel...
We expose the danger of reward poisoning in offline multi-agent reinforcement learning (MARL), where...
This study presents a unified resilient model-free reinforcement learning (RL) based distributed con...