International audienceReference annotated (or gold-standard) datasets are required for various common tasks such as training for machine learning systems or system validation. They are necessary to analyse or compare occurrences or items annotated by experts, or to compare objects resulting from any computational process to objects annotated (selected and characterized) by experts. But, even if reference annotated gold-standard corpora are required, their production is known as a difficult problem, from both a theoretical and practical point of view. Many studies devoted to these issues conclude that multi-annotation is most of the time a necessity. Measuring the inter-annotator agreement, which is required to check the reliability of data ...
The creation of golden standard datasets is a costly business. Optimally more than one judgment per ...
National audienceA lot of data is produced by NLP (automatic systems) and for NLP (reference corpus,...
International audienceAgreement measures have been widely used in Computational Linguistics for more...
Computing inter-annotator agreement measures on a manually annotated corpus is necessary to evaluate...
The usual practice in assessing whether a multimodal annotated corpus is fit for purpose is to calcu...
Researchers who make use of multimodal annotated corpora are always presented with something of a di...
Annotation projects dealing with complex semantic or pragmatic phenomena face the dilemma of creatin...
International audienceIn this abstract we present a methodology to improve Argument annotation guide...
Semantic annotation tasks contain ambiguity and vagueness and require varying degrees of world knowl...
International audienceInter-coders agreement measures are used to assess the reliability of annotate...
In the TDB[1]-like corpora annotation efforts, which are constructed by the intuitions of the annota...
National audienceBuilding reference corpora makes it necessary to align annotations and to measure a...
Many interesting phenomena in conversation can only be annotated as a subjective task, requiring int...
Standard agreement measures for interannota-tor reliability are neither necessary nor suffi-cient to...
This is the accompanying data for the paper "Analyzing Dataset Annotation Quality Management in the...
The creation of golden standard datasets is a costly business. Optimally more than one judgment per ...
National audienceA lot of data is produced by NLP (automatic systems) and for NLP (reference corpus,...
International audienceAgreement measures have been widely used in Computational Linguistics for more...
Computing inter-annotator agreement measures on a manually annotated corpus is necessary to evaluate...
The usual practice in assessing whether a multimodal annotated corpus is fit for purpose is to calcu...
Researchers who make use of multimodal annotated corpora are always presented with something of a di...
Annotation projects dealing with complex semantic or pragmatic phenomena face the dilemma of creatin...
International audienceIn this abstract we present a methodology to improve Argument annotation guide...
Semantic annotation tasks contain ambiguity and vagueness and require varying degrees of world knowl...
International audienceInter-coders agreement measures are used to assess the reliability of annotate...
In the TDB[1]-like corpora annotation efforts, which are constructed by the intuitions of the annota...
National audienceBuilding reference corpora makes it necessary to align annotations and to measure a...
Many interesting phenomena in conversation can only be annotated as a subjective task, requiring int...
Standard agreement measures for interannota-tor reliability are neither necessary nor suffi-cient to...
This is the accompanying data for the paper "Analyzing Dataset Annotation Quality Management in the...
The creation of golden standard datasets is a costly business. Optimally more than one judgment per ...
National audienceA lot of data is produced by NLP (automatic systems) and for NLP (reference corpus,...
International audienceAgreement measures have been widely used in Computational Linguistics for more...