Neural metrics have achieved impressive correlation with human judgements in the evaluation of machine translation systems, but before we can safely optimise towards such metrics, we should be aware of (and ideally eliminate) biases toward bad translations that receive high scores. Our experiments show that sample-based Minimum Bayes Risk decoding can be used to explore and quantify such weaknesses. When applying this strategy to COMET for en-de and de-en, we find that COMET models are not sensitive enough to discrepancies in numbers and named entities. We further show that these biases are hard to fully remove by simply training on additional synthetic data and release our code and data for facilitating further experiments
We present a comparison of automatic metrics against human evaluations of translation quality in sev...
This article reports a multifaceted comparison between statistical and neural machine translation (...
We present a novel scheme to combine neural machine translation (NMT) with traditional statistical m...
Neural metrics have achieved impressive correlation with human judgements in the evaluation of machi...
Neural metrics have achieved impressive correlation with human judgements in the evaluation of machi...
Neural Machine Translation (NMT) currently exhibits biases such as producing translations that are t...
Automatic metrics play a crucial role in machine translation. Despite the widespread use of n-gram-b...
Recent studies have revealed a number of pathologies of neural machine translation (NMT) systems. Hy...
Word sense disambiguation is a well-known source of translation errors in NMT. We posit that some of...
We present a Minimum Bayes Risk (MBR) decoder for statistical machine translation. The approach aims...
Translating text that diverges from the training domain is a key challenge for machine translation. ...
Solid evaluation of neural machine translation (NMT) is key to its understanding and improvement. Cu...
Automatic Machine Translation (MT) evaluation is an active field of research, with a handful of new ...
Recent research suggests that neural machine translation achieves parity with professional human tra...
We present Minimum Bayes-Risk (MBR) de-coding for statistical machine translation. This statistical ...
We present a comparison of automatic metrics against human evaluations of translation quality in sev...
This article reports a multifaceted comparison between statistical and neural machine translation (...
We present a novel scheme to combine neural machine translation (NMT) with traditional statistical m...
Neural metrics have achieved impressive correlation with human judgements in the evaluation of machi...
Neural metrics have achieved impressive correlation with human judgements in the evaluation of machi...
Neural Machine Translation (NMT) currently exhibits biases such as producing translations that are t...
Automatic metrics play a crucial role in machine translation. Despite the widespread use of n-gram-b...
Recent studies have revealed a number of pathologies of neural machine translation (NMT) systems. Hy...
Word sense disambiguation is a well-known source of translation errors in NMT. We posit that some of...
We present a Minimum Bayes Risk (MBR) decoder for statistical machine translation. The approach aims...
Translating text that diverges from the training domain is a key challenge for machine translation. ...
Solid evaluation of neural machine translation (NMT) is key to its understanding and improvement. Cu...
Automatic Machine Translation (MT) evaluation is an active field of research, with a handful of new ...
Recent research suggests that neural machine translation achieves parity with professional human tra...
We present Minimum Bayes-Risk (MBR) de-coding for statistical machine translation. This statistical ...
We present a comparison of automatic metrics against human evaluations of translation quality in sev...
This article reports a multifaceted comparison between statistical and neural machine translation (...
We present a novel scheme to combine neural machine translation (NMT) with traditional statistical m...