GiBERT : enhancing BERT with linguistic information using a lightweight gated injection method

Peinelt, Nicole
Rei, Marek
Liakata, Maria

Open PDF

Open link

Publication date

December 2021

DOI

10.18653/v1/2021.findings-emnlp.200

Publisher

Association for Computational Linguistics (ACL)

Abstract

Large pre-trained language models such as BERT have been the driving force behind recent improvements across many NLP tasks. However, BERT is only trained to predict missing words – either through masking or next sentence prediction – and has no knowledge of lexical, syntactic or semantic information beyond what it picks up through unsupervised pre-training. We propose a novel method to explicitly inject linguistic information in the form of word embeddings into any layer of a pre-trained BERT. When injecting counter-fitted and dependency-based embeddings, the performance improvements on multiple semantic similarity datasets indicate that such information is beneficial and currently missing from the original model. Our qualitative analysis ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

GiBERT : enhancing BERT with linguistic information using a lightweight gated injection method

Abstract

Extracted data

GiBERT : enhancing BERT with linguistic information using a lightweight gated injection method

Abstract

Extracted data

Related items

Related items