Selecting Informative Contexts Improves Language Model Finetuning

Antonello, Richard
Beckage, Nicole
Turek, Javier
Huth, Alexander

Publication date

May 2022

Language

English

Abstract

Language model fine-tuning is essential for modern natural language processing, but is computationally expensive and time-consuming. Further, the effectiveness of fine-tuning is limited by the inclusion of training examples that negatively affect performance. Here we present a general fine-tuning method that we call information gain filtration for improving the overall training efficiency and final performance of language model fine-tuning. We define the information gain of an example as the improvement on a test metric after training on that example. A secondary learner is then trained to approximate this quantity. During fine-tuning, this learner selects informative examples and skips uninformative ones. We show that our method has consis...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Selecting Informative Contexts Improves Language Model Finetuning

Abstract

Extracted data

Selecting Informative Contexts Improves Language Model Finetuning

Abstract

Extracted data

Related items

Related items