Continual Learning of Neural Machine Translation within Low Forgetting Risk Regions

Gu, Shuhao
Hu, Bojie
Feng, Yang

Publication date

November 2022

Language

English

Abstract

This paper considers continual learning of large-scale pretrained neural machine translation model without accessing the previous training data or introducing model separation. We argue that the widely used regularization-based methods, which perform multi-objective learning with an auxiliary loss, suffer from the misestimate problem and cannot always achieve a good balance between the previous and new tasks. To solve the problem, we propose a two-stage training method based on the local features of the real loss. We first search low forgetting risk regions, where the model can retain the performance on the previous task as the parameters are updated, to avoid the catastrophic forgetting problem. Then we can continually train the model with...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Continual Learning of Neural Machine Translation within Low Forgetting Risk Regions

Abstract

Extracted data

Continual Learning of Neural Machine Translation within Low Forgetting Risk Regions

Abstract

Extracted data

Related items

Related items