Improving Pre-trained Language Model Fine-tuning with Noise Stability Regularization

Hua, Hang
Li, Xingjian
Dou, Dejing
Xu, Cheng-Zhong
Luo, Jiebo

Open PDF

Open link

Publication date

November 2023

DOI

10.1109/TNNLS.2023.3330926

Language

English

Abstract

The advent of large-scale pre-trained language models has contributed greatly to the recent progress in natural language processing. Many state-of-the-art language models are first trained on a large text corpus and then fine-tuned on downstream tasks. Despite its recent success and wide adoption, fine-tuning a pre-trained language model often suffers from overfitting, which leads to poor generalizability due to the extremely high complexity of the model and the limited training samples from downstream tasks. To address this problem, we propose a novel and effective fine-tuning framework, named Layerwise Noise Stability Regularization (LNSR). Specifically, we propose to inject the standard Gaussian noise or In-manifold noise and regularize ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Improving Pre-trained Language Model Fine-tuning with Noise Stability Regularization

Abstract

Extracted data

Improving Pre-trained Language Model Fine-tuning with Noise Stability Regularization

Abstract

Extracted data

Related items

Related items