For annotation tasks involving independent judgments, probabilistic models have been used to infer ground truth labels from data where a crowd of many annotators labels the same items. Such models have been shown to produce results superior to taking the majority vote, but have not been applied to sequential data. We present two methods to infer ground truth labels from sequential annotations where we assume judgments are not independent, based on the observation that an annotator’s segments all tend to be several utterances long. The data consists of crowd labels for anno-tation of discourse segment boundaries. The new methods extend Hidden Markov Models to relax the independence assumption. The two methods are distinct, so positive labels...
International audienceDiscourse segmentation, the first step of discourse analysis, has been shown t...
Crowdsourcing has revolutionised the way tasks can be completed but the process is frequently ineffi...
So far, predictions of user quality judgments in response to spoken dialog systems have been achieve...
The need to model the relation between discourse structure and linguistic features of ut-terances is...
Machine learning applications can benefit greatly from vast amounts of data, provided that reliable ...
Data annotation in modern practice often involves multiple, imperfect human annotators. Multiple ann...
Crowd-sourcing is a cheap and popular means of creating training and evaluation datasets for machine...
The analysis of crowdsourced annotations in natural language processing is concerned with identifyin...
Crowdsourcing is a popular cheap alternative in machine learning for gathering information from a se...
Crowd sequential annotations can be an efficient and cost-effective way to build large datasets for ...
In NLP annotation, it is common to have multiple annotators label the text and then obtain the groun...
© 2019 Dr. Yuan LiThis thesis explores aggregation methods for crowdsourced annotations. Crowdsourci...
Incorporating annotators' knowledge into a machine-learning framework for detecting psychological tr...
This paper reports an ongoing effort to derive linear discourse structures from a corpus of telephon...
Distributing labeling tasks among hundreds or thousands of annotators is an increasingly important m...
International audienceDiscourse segmentation, the first step of discourse analysis, has been shown t...
Crowdsourcing has revolutionised the way tasks can be completed but the process is frequently ineffi...
So far, predictions of user quality judgments in response to spoken dialog systems have been achieve...
The need to model the relation between discourse structure and linguistic features of ut-terances is...
Machine learning applications can benefit greatly from vast amounts of data, provided that reliable ...
Data annotation in modern practice often involves multiple, imperfect human annotators. Multiple ann...
Crowd-sourcing is a cheap and popular means of creating training and evaluation datasets for machine...
The analysis of crowdsourced annotations in natural language processing is concerned with identifyin...
Crowdsourcing is a popular cheap alternative in machine learning for gathering information from a se...
Crowd sequential annotations can be an efficient and cost-effective way to build large datasets for ...
In NLP annotation, it is common to have multiple annotators label the text and then obtain the groun...
© 2019 Dr. Yuan LiThis thesis explores aggregation methods for crowdsourced annotations. Crowdsourci...
Incorporating annotators' knowledge into a machine-learning framework for detecting psychological tr...
This paper reports an ongoing effort to derive linear discourse structures from a corpus of telephon...
Distributing labeling tasks among hundreds or thousands of annotators is an increasingly important m...
International audienceDiscourse segmentation, the first step of discourse analysis, has been shown t...
Crowdsourcing has revolutionised the way tasks can be completed but the process is frequently ineffi...
So far, predictions of user quality judgments in response to spoken dialog systems have been achieve...