Detecting Textual Adversarial Examples through Randomized Substitution and Vote

Wang, Xiaosen
Xiong, Yifeng
He, Kun

Publication date

July 2022

Language

English

Abstract

A line of work has shown that natural text processing models are vulnerable to adversarial examples. Correspondingly, various defense methods are proposed to mitigate the threat of textual adversarial examples, eg, adversarial training, input transformations, detection, etc. In this work, we treat the optimization process for synonym substitution based textual adversarial attacks as a specific sequence of word replacement, in which each word mutually influences other words. We identify that we could destroy such mutual interaction and eliminate the adversarial perturbation by randomly substituting a word with its synonyms. Based on this observation, we propose a novel textual adversarial example detection method, termed Randomized Substitut...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Detecting Textual Adversarial Examples through Randomized Substitution and Vote

Abstract

Extracted data

Detecting Textual Adversarial Examples through Randomized Substitution and Vote

Abstract

Extracted data

Related items

Related items