ZeroGen$^+$: Self-Guided High-Quality Data Generation in Efficient Zero-Shot Learning

Gao, Jiahui
Pi, Renjie
Lin, Yong
Xu, Hang
Ye, Jiacheng
Wu, Zhiyong
Liang, Xiaodan
Li, Zhenguo
Kong, Lingpeng

Publication date

May 2022

Abstract

Nowadays, owing to the superior capacity of the large pre-trained language models (PLM), the PLM-based zero-shot learning has shown promising performances on various natural language processing tasks. There are emerging interests in further exploring the zero-shot learning potential of PLMs. Among them, ZeroGen attempts to purely use PLM to generate data and train a tiny model without relying on any task-specific annotation. Despite its remarkable results, we observe that the synthesized data from PLM contains a significant portion of samples with low quality, overfitting on such data greatly hampers the performance of the trained model and makes it unreliable for deployment.Since no gold data is accessible in zero-shot scenario, it is hard...

Extracted data

We use cookies to provide a better user experience.

Data Protection

ZeroGen$^+$: Self-Guided High-Quality Data Generation in Efficient Zero-Shot Learning

Abstract

Extracted data

ZeroGen$^+$: Self-Guided High-Quality Data Generation in Efficient Zero-Shot Learning

Abstract

Extracted data

Related items

Related items