ELLE: Efficient Lifelong Pre-training for Emerging Data

Qin, Yujia
Zhang, Jiajie
Lin, Yankai
Liu, Zhiyuan
Li, Peng
Sun, Maosong
Zhou, Jie

Publication date

July 2022

Abstract

Current pre-trained language models (PLM) are typically trained with static data, ignoring that in real-world scenarios, streaming data of various sources may continuously grow. This requires PLMs to integrate the information from all the sources in a lifelong manner. Although this goal could be achieved by exhaustive pre-training on all the existing data, such a process is known to be computationally expensive. To this end, we propose ELLE, aiming at efficient lifelong pre-training for emerging data. Specifically, ELLE consists of (1) function preserved model expansion, which flexibly expands an existing PLM's width and depth to improve the efficiency of knowledge acquisition; and (2) pre-trained domain prompts, which disentangle the versa...

Extracted data

We use cookies to provide a better user experience.

Data Protection

ELLE: Efficient Lifelong Pre-training for Emerging Data

Abstract

Extracted data

ELLE: Efficient Lifelong Pre-training for Emerging Data

Abstract

Extracted data

Related items

Related items