SemAttack: Natural Textual Attacks via Different Semantic Spaces

Wang, Boxin
Xu, Chejian
Liu, Xiangyu
Cheng, Yu
Li, Bo

Publication date

June 2022

Abstract

Recent studies show that pre-trained language models (LMs) are vulnerable to textual adversarial attacks. However, existing attack methods either suffer from low attack success rates or fail to search efficiently in the exponentially large perturbation space. We propose an efficient and effective framework SemAttack to generate natural adversarial text by constructing different semantic perturbation functions. In particular, SemAttack optimizes the generated perturbations constrained on generic semantic spaces, including typo space, knowledge space (e.g., WordNet), contextualized semantic space (e.g., the embedding space of BERT clusterings), or the combination of these spaces. Thus, the generated adversarial texts are more semantically clo...

Extracted data

We use cookies to provide a better user experience.

Data Protection

SemAttack: Natural Textual Attacks via Different Semantic Spaces

Abstract

Extracted data

SemAttack: Natural Textual Attacks via Different Semantic Spaces

Abstract

Extracted data

Related items

Related items