Sensi-BERT: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient BERT

Kundu, Souvik
Sridhar, Sharath Nittur
Szankin, Maciej
Sundaresan, Sairam

Publication date

August 2023

Language

English

Abstract

Large pre-trained language models have recently gained significant traction due to their improved performance on various down-stream tasks like text classification and question answering, requiring only few epochs of fine-tuning. However, their large model sizes often prohibit their applications on resource-constrained edge devices. Existing solutions of yielding parameter-efficient BERT models largely rely on compute-exhaustive training and fine-tuning. Moreover, they often rely on additional compute heavy models to mitigate the performance gap. In this paper, we present Sensi-BERT, a sensitivity driven efficient fine-tuning of BERT models that can take an off-the-shelf pre-trained BERT model and yield highly parameter-efficient models for...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Sensi-BERT: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient BERT

Abstract

Extracted data

Sensi-BERT: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient BERT

Abstract

Extracted data

Related items

Related items