SAS: Self-Augmentation Strategy for Language Model Pre-training

Xu, Yifei
Zhang, Jingqiao
He, Ru
Ge, Liangzhu
Yang, Chao
Yang, Cheng
Wu, Ying Nian

Open link

Publication date

June 2022

DOI

10.1609/aaai.v36i10.21412

Publisher

Association for the Advancement of Artificial Intelligence

Abstract

The core of self-supervised learning for pre-training language models includes pre-training task design as well as appropriate data augmentation. Most data augmentations in language model pre-training are context-independent. A seminal contextualized augmentation was recently proposed in ELECTRA and achieved state-of-the-art performance by introducing an auxiliary generation network (generator) to produce contextualized data augmentation for the training of a main discrimination network (discriminator). This design, however, introduces extra computation cost of the generator and a need to adjust the relative capability between the generator and the discriminator. In this paper, we propose a self-augmentation strategy (SAS) where a single ne...

Extracted data

We use cookies to provide a better user experience.

Data Protection

SAS: Self-Augmentation Strategy for Language Model Pre-training

Abstract

Extracted data

SAS: Self-Augmentation Strategy for Language Model Pre-training

Abstract

Extracted data

Related items

Related items