A New Method for the Study of Correlations between MT Evaluation Metrics

Estrella, Paula Susana
Popescu-Belis, Andréi
King, Margaret

Publication date

January 2007

Abstract

This paper aims at providing a reliable method for measuring the correlations between different scores of evaluation metrics applied to machine translated texts. A series of examples from recent MT evaluation experiments are first discussed, including results and data from the recent French MT evaluation campaign, CESTA, which is used here. To compute correlation, a set of 1,500 samples for each system and each evaluation metric are created using bootstrapping. Correlations between metrics, both automatic and applied by human judges, are then computed over these samples. The results confirm the previously observed correlations between some automatic metrics, but also indicate a lack of correlation between human and automatic metrics on the ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

A New Method for the Study of Correlations between MT Evaluation Metrics

Abstract

Extracted data

A New Method for the Study of Correlations between MT Evaluation Metrics

Abstract

Extracted data

Related items

Related items