Agree to disagree: analysis of Inter-annotator disagreements in human evaluation of machine translation output

Popović, Maja

Open PDF

Open link

Publication date

November 2021

DOI

10.18653/v1/2021.conll-1.18

Publisher

Association for Computational Linguistics (ACL)

Abstract

This work describes an analysis of inter-annotator disagreements in human evaluation of machine translation output. The errors in the analysed texts were marked by multiple annotators under guidance of different quality criteria: adequacy, comprehension, and an unspecified generic mixture of adequacy and fluency. Our results show that different criteria result in different disagreements, and indicate that a clear definition of quality criterion can improve the inter-annotator agreement. Furthermore, our results show that for certain linguistic phenomena which are not limited to one or two words (such as word ambiguity or gender) but span over several words or even entire phrases (such as negation or relative clause), disagreements do not ne...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Agree to disagree: analysis of Inter-annotator disagreements in human evaluation of machine translation output

Abstract

Extracted data

Agree to disagree: analysis of Inter-annotator disagreements in human evaluation of machine translation output

Abstract

Extracted data

Related items

Related items