From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression

Xu, Runxin
Luo, Fuli
Wang, Chengyu
Chang, Baobao
Huang, Jun
Huang, Songfang
Huang, Fei

Open link

Publication date

June 2022

DOI

10.1609/aaai.v36i10.21408

Publisher

Association for the Advancement of Artificial Intelligence

Abstract

Pre-trained Language Models (PLMs) have achieved great success in various Natural Language Processing (NLP) tasks under the pre-training and fine-tuning paradigm. With large quantities of parameters, PLMs are computation-intensive and resource-hungry. Hence, model pruning has been introduced to compress large-scale PLMs. However, most prior approaches only consider task-specific knowledge towards downstream tasks, but ignore the essential task-agnostic knowledge during pruning, which may cause catastrophic forgetting problem and lead to poor generalization ability. To maintain both task-agnostic and task-specific knowledge in our pruned model, we propose ContrAstive Pruning (CAP) under the paradigm of pre-training and fine-tuning. I...

Extracted data

We use cookies to provide a better user experience.

Data Protection

From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression

Abstract

Extracted data

From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression

Abstract

Extracted data

Related items

Related items