CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

Shao, Yunfan
Geng, Zhichao
Liu, Yitao
Dai, Junqi
Yan, Hang
Yang, Fei
Zhe, Li
Bao, Hujun
Qiu, Xipeng

Open PDF

Open link

Publication date

July 2022

DOI

10.1007/s11432-021-3536-5

Publisher

Springer Science and Business Media LLC

Language

English

Abstract

In this paper, we take the advantage of previous pre-trained models (PTMs) and propose a novel Chinese Pre-trained Unbalanced Transformer (CPT). Different from previous Chinese PTMs, CPT is designed to utilize the shared knowledge between natural language understanding (NLU) and natural language generation (NLG) to boost the performance. CPT consists of three parts: a shared encoder, an understanding decoder, and a generation decoder. Two specific decoders with a shared encoder are pre-trained with masked language modeling (MLM) and denoising auto-encoding (DAE) tasks, respectively. With the partially shared architecture and multi-task pre-training, CPT can (1) learn specific knowledge of both NLU or NLG tasks with two decoders and (2) be f...

Extracted data

We use cookies to provide a better user experience.

Data Protection

CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

Abstract

Extracted data

CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

Abstract

Extracted data

Related items

Related items