Using Selective Masking as a Bridge between Pre-training and Fine-tuning

Lad, Tanish
Maheshwari, Himanshu
Kottukkal, Shreyas
Mamidi, Radhika

Publication date

November 2022

Language

English

Abstract

Pre-training a language model and then fine-tuning it for downstream tasks has demonstrated state-of-the-art results for various NLP tasks. Pre-training is usually independent of the downstream task, and previous works have shown that this pre-training alone might not be sufficient to capture the task-specific nuances. We propose a way to tailor a pre-trained BERT model for the downstream task via task-specific masking before the standard supervised fine-tuning. For this, a word list is first collected specific to the task. For example, if the task is sentiment classification, we collect a small sample of words representing both positive and negative sentiments. Next, a word's importance for the task, called the word's task score, is measur...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Using Selective Masking as a Bridge between Pre-training and Fine-tuning

Abstract

Extracted data

Using Selective Masking as a Bridge between Pre-training and Fine-tuning

Abstract

Extracted data

Related items

Related items