The standard training algorithm in neural machine translation (NMT) suffers from exposure bias, and alternative algorithms have been proposed to mitigate this. However, the practical impact of exposure bias is under debate. In this paper, we link exposure bias to another well-known problem in NMT, namely the tendency to generate hallucinations under domain shift. In experiments on three datasets with multiple test domains, we show that exposure bias is partially to blame for hallucinations, and that training with Minimum Risk Training, which avoids exposure bias, can mitigate this. Our analysis explains why exposure bias is more problematic under domain shift, and also links exposure bias to the beam search problem, i.e. performance deterio...
The 2020 WMT Biomedical translation task evaluated Medline abstract translations. This is a small-do...
Recent studies have revealed a number of pathologies of neural machine translation (NMT) systems. Hy...
This paper considers continual learning of large-scale pretrained neural machine translation model w...
The standard training algorithm in neural machine translation (NMT) suffers from exposure bias, and ...
Translating text that diverges from the training domain is a key challenge for machine translation. ...
Neural Machine Translation (NMT) currently exhibits biases such as producing translations that are t...
Current language generation models suffer from issues such as repetition, incoherence, and hallucina...
Neural metrics have achieved impressive correlation with human judgements in the evaluation of machi...
Word sense disambiguation is a well-known source of translation errors in NMT. We posit that some of...
We report on search errors and model errors in neural machine translation (NMT). We present an exact...
In this thesis, I explore neural machine translation (NMT) models via targeted investigation of vari...
The advancement of neural network models has led to state-of-the-art performance in a wide range of ...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
This thesis aims for general robust Neural Machine Translation (NMT) that is agnostic to the test do...
The effect of translationese has been studied in the field of machine translation (MT), mostly with ...
The 2020 WMT Biomedical translation task evaluated Medline abstract translations. This is a small-do...
Recent studies have revealed a number of pathologies of neural machine translation (NMT) systems. Hy...
This paper considers continual learning of large-scale pretrained neural machine translation model w...
The standard training algorithm in neural machine translation (NMT) suffers from exposure bias, and ...
Translating text that diverges from the training domain is a key challenge for machine translation. ...
Neural Machine Translation (NMT) currently exhibits biases such as producing translations that are t...
Current language generation models suffer from issues such as repetition, incoherence, and hallucina...
Neural metrics have achieved impressive correlation with human judgements in the evaluation of machi...
Word sense disambiguation is a well-known source of translation errors in NMT. We posit that some of...
We report on search errors and model errors in neural machine translation (NMT). We present an exact...
In this thesis, I explore neural machine translation (NMT) models via targeted investigation of vari...
The advancement of neural network models has led to state-of-the-art performance in a wide range of ...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
This thesis aims for general robust Neural Machine Translation (NMT) that is agnostic to the test do...
The effect of translationese has been studied in the field of machine translation (MT), mostly with ...
The 2020 WMT Biomedical translation task evaluated Medline abstract translations. This is a small-do...
Recent studies have revealed a number of pathologies of neural machine translation (NMT) systems. Hy...
This paper considers continual learning of large-scale pretrained neural machine translation model w...