On the validity of pre-trained transformers for natural language processing in the software engineering domain

von der Mosel, Julian
Trautsch, Alexander
Herbold, Steffen

Publication date

May 2022

Abstract

Transformers are the current state-of-the-art of natural language processing in many domains and are using traction within software engineering research as well. Such models are pre-trained on large amounts of data, usually from the general domain. However, we only have a limited understanding regarding the validity of transformers within the software engineering domain, i.e., how good such models are at understanding words and sentences within a software engineering context and how this improves the state-of-the-art. Within this article, we shed light on this complex, but crucial issue. We compare BERT transformer models trained with software engineering data with transformers based on general domain data in multiple dimensions: their voca...

Extracted data

We use cookies to provide a better user experience.

Data Protection

On the validity of pre-trained transformers for natural language processing in the software engineering domain

Abstract

Extracted data

On the validity of pre-trained transformers for natural language processing in the software engineering domain

Abstract

Extracted data

Related items

Related items