ZeroGen: Efficient Zero-shot Learning via Dataset Generation

Ye, Jiacheng
Gao, Jiahui
Li, Qintong
Xu, Hang
Feng, Jiangtao
Wu, Zhiyong
Yu, Tao
Kong, Lingpeng

Publication date

February 2022

Abstract

There is a growing interest in dataset generation recently due to the superior generative capacity of large pre-trained language models (PLMs). In this paper, we study a flexible and efficient zero-short learning method, ZeroGen. Given a zero-shot task, we first generate a dataset from scratch using PLMs in an unsupervised manner. Then, we train a tiny task model (e.g., LSTM) under the supervision of the synthesized dataset. This approach allows highly efficient inference as the final task model only has orders of magnitude fewer parameters comparing to PLMs (e.g., GPT2-XL). Apart from being annotation-free and efficient, we argue that ZeroGen can also provide useful insights from the perspective of data-free model-agnostic knowledge distil...

Extracted data

We use cookies to provide a better user experience.

Data Protection

ZeroGen: Efficient Zero-shot Learning via Dataset Generation

Abstract

Extracted data

ZeroGen: Efficient Zero-shot Learning via Dataset Generation

Abstract

Extracted data

Related items

Related items