Crowdsourcing is widely used nowadays in machine learning for data labeling. Although in the traditional case annotators are asked to provide a single label for each instance, novel approaches allow annotators, in case of doubt, to choose a subset of labels as a way to extract more information from them. In both the traditional and these novel approaches, the reliability of the labelers can be modeled based on the collections of labels that they provide. In this paper, we propose an Expectation-Maximization-based method for crowdsourced data with candidate sets. Iteratively the likelihood of the parameters that model the reliability of the labelers is maximized, while the ground truth is estimated. The experimental results suggest that th...
Collecting labels for data is important for many practical applications (e.g., data mining). However...
The Dawid-Skene estimator has been widely used for inferring the true labels from the noisy labels p...
Collecting labels for data is important for many practical applications (e.g., data mining). However...
Crowdsourcing is widely used nowadays in machine learning for data labeling. Although in the traditi...
Crowdsourcing is a popular cheap alternative in machine learning for gathering information from a se...
Crowdsourcing is a popular cheap alternative in machine learning for gathering information from a se...
© 2019 Dr. Yuan LiThis thesis explores aggregation methods for crowdsourced annotations. Crowdsourci...
Machine learning applications can benefit greatly from vast amounts of data, provided that reliable ...
With crowdsourcing systems, labels can be obtained with low cost, which facilitates the creation of ...
We propose novel algorithms for the problem of crowd- sourcing binary labels. Such binary labeling t...
Crowdsourcing has become an effective and popular tool for human-powered computation to label large ...
To deal with the low qualities of web workers in crowdsourcing, many unsupervised label aggregation ...
This paper presents an aggregation approach that learns a regression model from crowdsourced annotat...
This paper presents an aggregation approach that learns a regression model from crowdsourced annotat...
Over the last few years, deep learning has revolutionized the field of machine learning by dramatica...
Collecting labels for data is important for many practical applications (e.g., data mining). However...
The Dawid-Skene estimator has been widely used for inferring the true labels from the noisy labels p...
Collecting labels for data is important for many practical applications (e.g., data mining). However...
Crowdsourcing is widely used nowadays in machine learning for data labeling. Although in the traditi...
Crowdsourcing is a popular cheap alternative in machine learning for gathering information from a se...
Crowdsourcing is a popular cheap alternative in machine learning for gathering information from a se...
© 2019 Dr. Yuan LiThis thesis explores aggregation methods for crowdsourced annotations. Crowdsourci...
Machine learning applications can benefit greatly from vast amounts of data, provided that reliable ...
With crowdsourcing systems, labels can be obtained with low cost, which facilitates the creation of ...
We propose novel algorithms for the problem of crowd- sourcing binary labels. Such binary labeling t...
Crowdsourcing has become an effective and popular tool for human-powered computation to label large ...
To deal with the low qualities of web workers in crowdsourcing, many unsupervised label aggregation ...
This paper presents an aggregation approach that learns a regression model from crowdsourced annotat...
This paper presents an aggregation approach that learns a regression model from crowdsourced annotat...
Over the last few years, deep learning has revolutionized the field of machine learning by dramatica...
Collecting labels for data is important for many practical applications (e.g., data mining). However...
The Dawid-Skene estimator has been widely used for inferring the true labels from the noisy labels p...
Collecting labels for data is important for many practical applications (e.g., data mining). However...