HyperPrompt: Prompt-based Task-Conditioning of Transformers

He, Yun
Zheng, Huaixiu Steven
Tay, Yi
Gupta, Jai
Du, Yu
Aribandi, Vamsi
Zhao, Zhe
Li, YaGuang
Chen, Zhao
Metzler, Donald
Cheng, Heng-Tze
Chi, Ed H.

Publication date

June 2022

Abstract

Prompt-Tuning is a new paradigm for finetuning pre-trained language models in a parameter-efficient way. Here, we explore the use of HyperNetworks to generate hyper-prompts: we propose HyperPrompt, a novel architecture for prompt-based task-conditioning of self-attention in Transformers. The hyper-prompts are end-to-end learnable via generation by a HyperNetwork. HyperPrompt allows the network to learn task-specific feature maps where the hyper-prompts serve as task global memories for the queries to attend to, at the same time enabling flexible information sharing among tasks. We show that HyperPrompt is competitive against strong multi-task learning baselines with as few as $0.14\%$ of additional task-conditioning parameters, achieving gr...

Extracted data

We use cookies to provide a better user experience.

Data Protection

HyperPrompt: Prompt-based Task-Conditioning of Transformers

Abstract

Extracted data

HyperPrompt: Prompt-based Task-Conditioning of Transformers

Abstract

Extracted data

Related items

Related items