Learning part-of-speech taggers with inter-annotator agreement loss

Barbara Plank
Dirk Hovy
Anders Søgaard

Publication date

January 2014

DOI

Abstract

In natural language processing (NLP) an-notation projects, we use inter-annotator agreement measures and annotation guide-lines to ensure consistent annotations. However, annotation guidelines often make linguistically debatable and even somewhat arbitrary decisions, and inter-annotator agreement is often less than perfect. While annotation projects usu-ally specify how to deal with linguisti-cally debatable phenomena, annotator dis-agreements typically still stem from these “hard ” cases. This indicates that some er-rors are more debatable than others. In this paper, we use small samples of doubly-annotated part-of-speech (POS) data for Twitter to estimate annotation reliability and show how those metrics of likely inter-annotator agreemen...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Learning part-of-speech taggers with inter-annotator agreement loss

Abstract

Extracted data

Learning part-of-speech taggers with inter-annotator agreement loss

Abstract

Extracted data

Related items

Related items