Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora

Jin, Xisen
Zhang, Dejiao
Zhu, Henghui
Xiao, Wei
Li, Shang-Wen
Wei, Xiaokai
Arnold, Andrew
Ren, Xiang

Publication date

July 2022

Language

English

Abstract

Pretrained language models (PTLMs) are typically learned over a large, static corpus and further fine-tuned for various downstream tasks. However, when deployed in the real world, a PTLM-based model must deal with data distributions that deviate from what the PTLM was initially trained on. In this paper, we study a lifelong language model pretraining challenge where a PTLM is continually updated so as to adapt to emerging data. Over a domain-incremental research paper stream and a chronologically-ordered tweet stream, we incrementally pretrain a PTLM with different continual learning algorithms, and keep track of the downstream task performance (after fine-tuning). We evaluate PTLM's ability to adapt to new corpora while retaining learned k...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora

Abstract

Extracted data

Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora

Abstract

Extracted data

Related items

Related items