Integer Fine-tuning of Transformer-based Models

Tayaranian, Mohammadreza
Ghaffari, Alireza
Tahaei, Marzieh S.
Rezagholizadeh, Mehdi
Asgharian, Masoud
Nia, Vahid Partovi

Publication date

September 2022

Language

English

Abstract

Transformer based models are used to achieve state-of-the-art performance on various deep learning tasks. Since transformer-based models have large numbers of parameters, fine-tuning them on downstream tasks is computationally intensive and energy hungry. Automatic mixed-precision FP32/FP16 fine-tuning of such models has been previously used to lower the compute resource requirements. However, with the recent advances in the low-bit integer back-propagation, it is possible to further reduce the computation and memory foot-print. In this work, we explore a novel integer training method that uses integer arithmetic for both forward propagation and gradient computation of linear, convolutional, layer-norm, and embedding layers in transformer-b...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Integer Fine-tuning of Transformer-based Models

Abstract

Extracted data

Integer Fine-tuning of Transformer-based Models

Abstract

Extracted data

Related items

Related items