Improving the Reusability of Pre-trained Language Models in Real-world Applications

Ghanbarzadeh, Somayeh
Palangi, Hamid
Huang, Yan
Moreno, Radames Cruz
Khanpour, Hamed

Publication date

August 2023

Language

English

Abstract

The reusability of state-of-the-art Pre-trained Language Models (PLMs) is often limited by their generalization problem, where their performance drastically decreases when evaluated on examples that differ from the training dataset, known as Out-of-Distribution (OOD)/unseen examples. This limitation arises from PLMs' reliance on spurious correlations, which work well for frequent example types but not for general examples. To address this issue, we propose a training approach called Mask-tuning, which integrates Masked Language Modeling (MLM) training objectives into the fine-tuning process to enhance PLMs' generalization. Comprehensive experiments demonstrate that Mask-tuning surpasses current state-of-the-art techniques and enhances PLMs'...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Improving the Reusability of Pre-trained Language Models in Real-world Applications

Abstract

Extracted data

Improving the Reusability of Pre-trained Language Models in Real-world Applications

Abstract

Extracted data

Related items

Related items