Data annotation in modern practice often involves multiple, imperfect human annotators. Multiple annotations can be used to infer estimates of the ground-truth labels and to estimate individual annotator error characteristics (or reliability). We introduce MOMRESP, a model that improves upon item response models to incorporate information from both natural data clusters as well as annotations from multiple annotators to infer ground-truth labels for the document classification task. We implement this model and show that MOMRESP can use unlabeled data to improve estimates of the ground-truth labels over a majority vote baseline dramatically in situations where both annotations are scarce and annotation quality is low as well as in situations...
Supervised learning from multiple labeling sources is an increasingly important problem in machine l...
For annotation tasks involving independent judgments, probabilistic models have been used to infer g...
This paper describes a crowdsourcing system that integrates machine learning techniques with hu-man ...
The analysis of crowdsourced annotations in natural language processing is concerned with identifyin...
The analysis of crowdsourced annotations in NLP is concerned with identifying 1) gold standard label...
With the increasing popularity of online crowdsourcing platforms such as Amazon Mechanical Turk (AMT...
Distributing labeling tasks among hundreds or thousands of annotators is an increasingly important m...
Machine learning applications can benefit greatly from vast amounts of data, provided that reliable ...
International audienceLarge-scale annotated corpora have yielded impressive performance improvements...
a b s t r a c t With the increasing popularity of online crowdsourcing platforms such as Amazon Mech...
This paper addresses the challenging problem of learning from multiple annotators whose labeling acc...
Abstract. Supervised learning from multiple annotators is an increasingly im-portant problem in mach...
Supervised learning from multiple labeling sources is an increasingly important problem in machine l...
© 2019 Dr. Yuan LiThis thesis explores aggregation methods for crowdsourced annotations. Crowdsourci...
Large-scale annotation efforts typically involve several experts who may disagree with each other. W...
Supervised learning from multiple labeling sources is an increasingly important problem in machine l...
For annotation tasks involving independent judgments, probabilistic models have been used to infer g...
This paper describes a crowdsourcing system that integrates machine learning techniques with hu-man ...
The analysis of crowdsourced annotations in natural language processing is concerned with identifyin...
The analysis of crowdsourced annotations in NLP is concerned with identifying 1) gold standard label...
With the increasing popularity of online crowdsourcing platforms such as Amazon Mechanical Turk (AMT...
Distributing labeling tasks among hundreds or thousands of annotators is an increasingly important m...
Machine learning applications can benefit greatly from vast amounts of data, provided that reliable ...
International audienceLarge-scale annotated corpora have yielded impressive performance improvements...
a b s t r a c t With the increasing popularity of online crowdsourcing platforms such as Amazon Mech...
This paper addresses the challenging problem of learning from multiple annotators whose labeling acc...
Abstract. Supervised learning from multiple annotators is an increasingly im-portant problem in mach...
Supervised learning from multiple labeling sources is an increasingly important problem in machine l...
© 2019 Dr. Yuan LiThis thesis explores aggregation methods for crowdsourced annotations. Crowdsourci...
Large-scale annotation efforts typically involve several experts who may disagree with each other. W...
Supervised learning from multiple labeling sources is an increasingly important problem in machine l...
For annotation tasks involving independent judgments, probabilistic models have been used to infer g...
This paper describes a crowdsourcing system that integrates machine learning techniques with hu-man ...