Abstract We present a first analysis of interannotator agreement for the DIT ++ tagset of dialogue acts, a comprehensive, layered, multidimensional set of 86 tags. Within a dimension or a layer, subsets of tags are often hierarchically organised. We argue that especially for such highly structured annotation schemes the well-known kappa statistic is not an adequate measure of inter-annotator agreement. Instead, we propose a statistic that takes the structural properties of the tagset into account, and we discuss the application of this statistic in an annotation experiment. The experiment shows promising agreement scores for most dimensions in the tagset and provides useful insights into the usability of the annotation scheme, but also indi...
This paper describes a system for inter-annotator agreement analysis of ERE an-notation, focusing on...
This paper describes the evaluation of a coding scheme for the segmentation and annotation of transl...
We report an annotation experiment aiming at assessing the use of a single functional taxonomy of se...
In this paper the dialogue act annotation of naive and expert annotators, both annotating the same d...
We present the Conversation Analysis Modeling Schema (CAMS), a novel dialogue labeling schema that c...
International audienceIn this abstract we present a methodology to improve Argument annotation guide...
In natural language processing (NLP) an-notation projects, we use inter-annotator agreement measures...
In recent years, the coefficient of agreement has become the de facto standard to evaluate intercode...
National audienceBuilding reference corpora makes it necessary to align annotations and to measure a...
International audienceAgreement measures have been widely used in Computational Linguistics for more...
The usual practice in assessing whether a multimodal annotated corpus is fit for purpose is to calcu...
International audienceLinguistic annotation underlies many successful approaches in Natural Language...
In the TDB[1]-like corpora annotation efforts, which are constructed by the intuitions of the annota...
International audienceReference annotated (or gold-standard) datasets are required for various commo...
Wlodarczak M. Ranked multidimensional dialogue act annotation. In: Slavkovik M, ed. Proceedings of t...
This paper describes a system for inter-annotator agreement analysis of ERE an-notation, focusing on...
This paper describes the evaluation of a coding scheme for the segmentation and annotation of transl...
We report an annotation experiment aiming at assessing the use of a single functional taxonomy of se...
In this paper the dialogue act annotation of naive and expert annotators, both annotating the same d...
We present the Conversation Analysis Modeling Schema (CAMS), a novel dialogue labeling schema that c...
International audienceIn this abstract we present a methodology to improve Argument annotation guide...
In natural language processing (NLP) an-notation projects, we use inter-annotator agreement measures...
In recent years, the coefficient of agreement has become the de facto standard to evaluate intercode...
National audienceBuilding reference corpora makes it necessary to align annotations and to measure a...
International audienceAgreement measures have been widely used in Computational Linguistics for more...
The usual practice in assessing whether a multimodal annotated corpus is fit for purpose is to calcu...
International audienceLinguistic annotation underlies many successful approaches in Natural Language...
In the TDB[1]-like corpora annotation efforts, which are constructed by the intuitions of the annota...
International audienceReference annotated (or gold-standard) datasets are required for various commo...
Wlodarczak M. Ranked multidimensional dialogue act annotation. In: Slavkovik M, ed. Proceedings of t...
This paper describes a system for inter-annotator agreement analysis of ERE an-notation, focusing on...
This paper describes the evaluation of a coding scheme for the segmentation and annotation of transl...
We report an annotation experiment aiming at assessing the use of a single functional taxonomy of se...