We expose the danger of reward poisoning in offline multi-agent reinforcement learning (MARL), whereby an attacker can modify the reward vectors to different learners in an offline data set while incurring a poisoning cost. Based on the poisoned data set, all rational learners using some confidence-bound-based MARL algorithm will infer that a target policy - chosen by the attacker and not necessarily a solution concept originally - is the Markov perfect dominant strategy equilibrium for the underlying Markov Game, hence they will adopt this potentially damaging target policy in the future. We characterize the exact conditions under which the attacker can install a target policy. We further show how the attacker can formulate a linear progra...
Many state-of-the-art cooperative multi-agent reinforcement learning (MARL) approaches, such as MADD...
Deep reinforcement learning models are vulnerable to adversarial attacks that can decrease a victim'...
Reinforcement learning (RL) has advanced greatly in the past few years with the employment of effect...
In offline multi-agent reinforcement learning (MARL), agents estimate policies from a given dataset....
We study black-box reward poisoning attacks against reinforcement learning (RL), in which an adversa...
We study defense strategies against reward poisoning attacks in reinforcement learning. As a threat ...
In recent years, a proliferation of methods were developed for cooperative multi-agent reinforcement...
This Ph.D. dissertation studies the control of multi-agent reinforcement learning (MARL) and multi-a...
Reinforcement Learning (RL) is a promising framework for solving sequential decision making problems...
In several reinforcement learning (RL) scenarios, mainly in security settings, there may be adversar...
Deep Learning methods are known to be vulnerable to adversarial attacks. Since Deep Reinforcement Le...
Communication is important in many multi-agent reinforcement learning (MARL) problems for agents to ...
We posit a new mechanism for cooperation in multi-agent reinforcement learning (MARL) based upon an...
We study reward design strategies for incentivizing a reinforcement learning agent to adopt a policy...
Adversarial attacks in reinforcement learning (RL) often assume highly-privileged access to the vict...
Many state-of-the-art cooperative multi-agent reinforcement learning (MARL) approaches, such as MADD...
Deep reinforcement learning models are vulnerable to adversarial attacks that can decrease a victim'...
Reinforcement learning (RL) has advanced greatly in the past few years with the employment of effect...
In offline multi-agent reinforcement learning (MARL), agents estimate policies from a given dataset....
We study black-box reward poisoning attacks against reinforcement learning (RL), in which an adversa...
We study defense strategies against reward poisoning attacks in reinforcement learning. As a threat ...
In recent years, a proliferation of methods were developed for cooperative multi-agent reinforcement...
This Ph.D. dissertation studies the control of multi-agent reinforcement learning (MARL) and multi-a...
Reinforcement Learning (RL) is a promising framework for solving sequential decision making problems...
In several reinforcement learning (RL) scenarios, mainly in security settings, there may be adversar...
Deep Learning methods are known to be vulnerable to adversarial attacks. Since Deep Reinforcement Le...
Communication is important in many multi-agent reinforcement learning (MARL) problems for agents to ...
We posit a new mechanism for cooperation in multi-agent reinforcement learning (MARL) based upon an...
We study reward design strategies for incentivizing a reinforcement learning agent to adopt a policy...
Adversarial attacks in reinforcement learning (RL) often assume highly-privileged access to the vict...
Many state-of-the-art cooperative multi-agent reinforcement learning (MARL) approaches, such as MADD...
Deep reinforcement learning models are vulnerable to adversarial attacks that can decrease a victim'...
Reinforcement learning (RL) has advanced greatly in the past few years with the employment of effect...