Linguistic Feature Injection for Efficient Natural Language Processing

Fioravanti S.
Zugarini A.
Giannini F.
Rigutini L.
Maggini M.
Diligenti M.

Open link

Publication date

January 2023

DOI

10.1109/IJCNN54540.2023.10191680

Publisher

place:New York

Abstract

Transformers have been established as one of the most effective neural approach in performing various Natural Language Processing tasks. However, following common trend in modern deep architectures, their scale has quickly grown to an extent that reduces the concrete possibility for several enterprises to train such models from scratch. Indeed, despite their high-level performances, Transformers have the general drawback of requiring a huge amount of training data, computational resources and energy consumption to be successfully optimized. For this reason, more recent architectures like Bidirectional Encoder Representations from Transformers rely on unlabeled data to pre-train the model, which is later fine-tuned for a specific downstream ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Linguistic Feature Injection for Efficient Natural Language Processing

Abstract

Extracted data

Linguistic Feature Injection for Efficient Natural Language Processing

Abstract

Extracted data

Related items

Related items