With crowdsourcing systems, labels can be obtained with low cost, which facilitates the creation of training sets for prediction model learning. However, the labels obtained from crowdsourcing are often imperfect, which brings great challenges in model learning. Since 2008, the machine learning community has noticed the great opportunities brought by crowdsourcing and has developed a large number of techniques to deal with inaccuracy, randomness, and uncertainty issues when learning with crowdsourcing. This paper summarizes the technical progress in this field during past eleven years. We focus on two fundamental issues: the data (label) quality and the prediction model quality. For data quality, we summarize ground truth inference methods ...
Biased labelers are a systemic problem in crowdsourcing, and a comprehensive toolbox for handling th...
Current quality control methods for crowdsourcing largely account for variations in worker responses...
With the proliferation of social media, gathering data has became cheaper and easier than before. Ho...
Although supervised learning requires a labeled dataset, obtaining labels from experts is generally ...
Although supervised learning requires a labeled dataset, ob- taining labels from experts is generall...
The advent of crowdsourcing has created a variety of new opportunities for improving upon traditiona...
Crowdsourcing is widely used nowadays in machine learning for data labeling. Although in the traditi...
The supervised learning-based recommendation models, whose infrastructures are sufficient training s...
Crowdsourcing services like Amazon’s Mechan-ical Turk have facilitated and greatly expedited the man...
We investigated the use of supervised learning methods that use labels from crowd workers to resolve...
Crowdsourcing is a popular cheap alternative in machine learning for gathering information from a se...
While crowdsourcing offers potential traction on data collection at scale, it also poses new and sig...
The internet enables us to collect and store unprecedented amounts of data. We need better models fo...
Bringing about models from great data sets and influential which subsets of data to mine is becoming...
© 2019 Dr. Yuan LiThis thesis explores aggregation methods for crowdsourced annotations. Crowdsourci...
Biased labelers are a systemic problem in crowdsourcing, and a comprehensive toolbox for handling th...
Current quality control methods for crowdsourcing largely account for variations in worker responses...
With the proliferation of social media, gathering data has became cheaper and easier than before. Ho...
Although supervised learning requires a labeled dataset, obtaining labels from experts is generally ...
Although supervised learning requires a labeled dataset, ob- taining labels from experts is generall...
The advent of crowdsourcing has created a variety of new opportunities for improving upon traditiona...
Crowdsourcing is widely used nowadays in machine learning for data labeling. Although in the traditi...
The supervised learning-based recommendation models, whose infrastructures are sufficient training s...
Crowdsourcing services like Amazon’s Mechan-ical Turk have facilitated and greatly expedited the man...
We investigated the use of supervised learning methods that use labels from crowd workers to resolve...
Crowdsourcing is a popular cheap alternative in machine learning for gathering information from a se...
While crowdsourcing offers potential traction on data collection at scale, it also poses new and sig...
The internet enables us to collect and store unprecedented amounts of data. We need better models fo...
Bringing about models from great data sets and influential which subsets of data to mine is becoming...
© 2019 Dr. Yuan LiThis thesis explores aggregation methods for crowdsourced annotations. Crowdsourci...
Biased labelers are a systemic problem in crowdsourcing, and a comprehensive toolbox for handling th...
Current quality control methods for crowdsourcing largely account for variations in worker responses...
With the proliferation of social media, gathering data has became cheaper and easier than before. Ho...