Multi-Head Adapter Routing for Data-Efficient Fine-Tuning

Caccia, Lucas
Ponti, Edoardo
Liu, Lucas
Pereira, Matheus
Roux, Nicolas Le
Sordoni, Alessandro

Publication date

November 2022

Language

English

Abstract

Parameter-efficient fine-tuning (PEFT) methods can adapt large language models to downstream tasks by training a small amount of newly added parameters. In multi-task settings, PEFT adapters typically train on each task independently, inhibiting transfer across tasks, or on the concatenation of all tasks, which can lead to negative interference. To address this, Polytropon (Ponti et al.) jointly learns an inventory of PEFT adapters and a routing function to share variable-size sets of adapters across tasks. Subsequently, adapters can be re-combined and fine-tuned on novel tasks even with limited data. In this paper, we investigate to what extent the ability to control which adapters are active for each task leads to sample-efficient general...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Multi-Head Adapter Routing for Data-Efficient Fine-Tuning

Abstract

Extracted data

Multi-Head Adapter Routing for Data-Efficient Fine-Tuning

Abstract

Extracted data

Topics

Related items

Topics

Related items