We study the problem of semantically annotating textual documents that are complex in the sense that the documents are long, feature rich, and domain specific. Due to their complexity, such annotation tasks require trained human workers, which are very expensive in both time and money. We propose CEMA, a method for deploying machine learning to assist humans in complex document annotation. CEMA estimates the human cost of annotating each document and selects the set of documents to be annotated that strike the best balance between model accuracy and human cost. We conduct experiments on complex annotation tasks in which we compare CEMA against other document selection and annotation strategies. Our results show that CEMA is the most cost-ef...
We present a novel approach to the selective annotation of large corpora through the use of machine ...
Typically, accuracy is used to represent the performance of an NLP system. However, accuracy attainm...
The CASAM multimedia annotation system implements a model of cooperative annotation between a human ...
Manual corpus annotation is getting widely used in Natural Language Processing (NLP). While being re...
We introduce a framework for automated semantic document annotation that is composed of four process...
The annotation of texts and other material in the field of digital humanities and Natural Language P...
Over the last decade, the state-of-the-art in text mining has moved towards the adoption of machine ...
Creating high-quality manual annotations on text corpus is time-consuming and often requires the wor...
Prior work has found that classifier accuracy can be improved early in the process by having each an...
Natural language processing (NLP) summarisers aim to capture the essential elements of a document. Y...
Many complex discourse-level tasks can aid domain experts in their work but require costly expert an...
The traditional process of document annotation for knowledge identification and extraction in the Se...
Annotation is a fundamental activity for information extraction. Annotation of large corpora provid...
Holter OM, Ell B. Human-Machine Collaborative Annotation: A Case Study with GPT-3. Presented at the ...
Arbeit an der Bibliothek noch nicht eingelangt - Daten nicht geprüftAbweichender Titel nach Übersetz...
We present a novel approach to the selective annotation of large corpora through the use of machine ...
Typically, accuracy is used to represent the performance of an NLP system. However, accuracy attainm...
The CASAM multimedia annotation system implements a model of cooperative annotation between a human ...
Manual corpus annotation is getting widely used in Natural Language Processing (NLP). While being re...
We introduce a framework for automated semantic document annotation that is composed of four process...
The annotation of texts and other material in the field of digital humanities and Natural Language P...
Over the last decade, the state-of-the-art in text mining has moved towards the adoption of machine ...
Creating high-quality manual annotations on text corpus is time-consuming and often requires the wor...
Prior work has found that classifier accuracy can be improved early in the process by having each an...
Natural language processing (NLP) summarisers aim to capture the essential elements of a document. Y...
Many complex discourse-level tasks can aid domain experts in their work but require costly expert an...
The traditional process of document annotation for knowledge identification and extraction in the Se...
Annotation is a fundamental activity for information extraction. Annotation of large corpora provid...
Holter OM, Ell B. Human-Machine Collaborative Annotation: A Case Study with GPT-3. Presented at the ...
Arbeit an der Bibliothek noch nicht eingelangt - Daten nicht geprüftAbweichender Titel nach Übersetz...
We present a novel approach to the selective annotation of large corpora through the use of machine ...
Typically, accuracy is used to represent the performance of an NLP system. However, accuracy attainm...
The CASAM multimedia annotation system implements a model of cooperative annotation between a human ...