Summary of the datasets.

Hyojin Jeon (2111560)
Seungcheol Park (16503084)
Jin-Gee Kim (16503087)
U. Kang (5537138)

Open link

Publication date

July 2023

DOI

10.1371/journal.pone.0288060.t006

Publisher

Public Library of Science (PLoS)

Abstract

Given a large Transformer model, how can we obtain a small and computationally efficient model which maintains the performance of the original model? Transformer has shown significant performance improvements for many NLP tasks in recent years. However, their large size, expensive computational cost, and long inference time make it challenging to deploy them to resource-constrained devices. Existing Transformer compression methods mainly focus on reducing the size of the encoder ignoring the fact that the decoder takes the major portion of the long inference time. In this paper, we propose PET (Parameter-Efficient knowledge distillation on Transformer), an efficient Transformer compression method that reduces the size of both the encoder an...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Summary of the datasets.

Abstract

Extracted data

Summary of the datasets.

Abstract

Extracted data

Related items

Related items