Language modeling and bidirectional coders representations: an overview of key technologies

D. I. Kachkou

Open PDF

Open link

Publication date

January 2021

DOI

10.37661/1816-0301-2020-17-4-61-72

Publisher

United Institute of Informatics Problems of the National Academy of Sciences of Belarus

Journal

Informatics

Abstract

The article is an essay on the development of technologies for natural language processing, which formed the basis of BERT (Bidirectional Encoder Representations from Transformers), a language model from Google, showing high results on the whole class of problems associated with the understanding of natural language. Two key ideas implemented in BERT are knowledge transfer and attention mechanism. The model is designed to solve two problems on a large unlabeled data set and can reuse the identified language patterns for effective learning for a specific text processing problem. Architecture Transformer is based on the attention mechanism, i.e. it involves evaluation of relationships between input data tokens. In addition, the article notes ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Language modeling and bidirectional coders representations: an overview of key technologies

Abstract

Extracted data

Language modeling and bidirectional coders representations: an overview of key technologies

Abstract

Extracted data

Related items

Related items