Transfer Learning for Structured Pruning under Limited Task Data

Dery, Lucio
Grangier, David
Hannun, Awni

Publication date

November 2023

Language

English

Abstract

Large, pre-trained models are problematic to use in resource constrained applications. Fortunately, task-aware structured pruning methods offer a solution. These approaches reduce model size by dropping structural units like layers and attention heads in a manner that takes into account the end-task. However, these pruning algorithms require more task-specific data than is typically available. We propose a framework which combines structured pruning with transfer learning to reduce the need for task-specific data. Our empirical results answer questions such as: How should the two tasks be coupled? What parameters should be transferred? And, when during training should transfer learning be introduced? Leveraging these insights, we demonstrat...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Transfer Learning for Structured Pruning under Limited Task Data

Abstract

Extracted data

Transfer Learning for Structured Pruning under Limited Task Data

Abstract

Extracted data

Related items

Related items