Language Modeling, Lexical Translation, Reordering: The Training Process of NMT through the Lens of Classical SMT

Voita, Elena
Sennrich, Rico
Titov, Ivan

Open PDF

Open link

Publication date

November 2021

DOI

10.5167/uzh-208888

Publisher

ACL Anthology

Language

English

Abstract

Differently from the traditional statistical MT that decomposes the translation task into distinct separately learned components, neural machine translation uses a single neural network to model the entire translation process. Despite neural machine translation being de-facto standard, it is still not clear how NMT models acquire different competences over the course of training, and how this mirrors the different models in traditional SMT. In this work, we look at the competences related to three core SMT components and find that during training, NMT first focuses on learning target-side language modeling, then improves translation quality approaching word-by-word translation, and finally learns more complicated reordering patterns. We sho...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Language Modeling, Lexical Translation, Reordering: The Training Process of NMT through the Lens of Classical SMT

Abstract

Extracted data

Language Modeling, Lexical Translation, Reordering: The Training Process of NMT through the Lens of Classical SMT

Abstract

Extracted data

Topics

Related items

Topics

Related items