Composable Sparse Fine-Tuning for Cross-Lingual Transfer

Ansell, Alan
Ponti, Edoardo Maria
Korhonen, Anna
Vulić, Ivan

Publication date

February 2023

Language

English

Abstract

Fine-tuning the entire set of parameters of a large pretrained model has become the mainstream approach for transfer learning. To increase its efficiency and prevent catastrophic forgetting and interference, techniques like adapters and sparse fine-tuning have been developed. Adapters are modular, as they can be combined to adapt a model towards different facets of knowledge (e.g., dedicated language and/or task adapters). Sparse fine-tuning is expressive, as it controls the behavior of all model components. In this work, we introduce a new fine-tuning method with both these desirable properties. In particular, we learn sparse, real-valued masks based on a simple variant of the Lottery Ticket Hypothesis. Task-specific masks are obtained fro...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Composable Sparse Fine-Tuning for Cross-Lingual Transfer

Abstract

Extracted data

Composable Sparse Fine-Tuning for Cross-Lingual Transfer

Abstract

Extracted data

Related items

Related items