Crowdsourcing has become an popular approach for annotating the large quantities of data required to train machine learning algorithms. However, obtaining labels in this manner poses two important challenges. First, naively labeling all of the data can be prohibitively expensive. Second, a significant fraction of the annotations can be incorrect due to carelessness or limited domain expertise of crowdsourced workers. Active learning provides a natural formulation to address the former issue by affordably selecting an appropriate subset of instances to label. Unfortunately, most active learning strategies are myopic and sensitive to label noise, which leads to poorly trained classifiers. We propose an active learning method that is specifica...
Automated activity recognition systems that use probabilistic models require labeled data sets in tr...
Conventional active learning algorithms assume a single labeler that produces noiseless label at a g...
<p>Most classic machine learning methods depend on the assumption that humans can annotate all the d...
Crowdsourcing has become an popular approach for annotating the large quantities of data required to...
Recognizing human activities from wearable sensor data is an important problem, particularly for hea...
With the proliferation of social media, gathering data has became cheaper and easier than before. Ho...
Active learning seeks to train the best classifier at the lowest annotation cost by intelligently pi...
Many machine learning datasets are noisy with a substantial number of mislabeled instances. This noi...
ii With the proliferation of social media, gathering data has became cheaper and easier than before....
In recent decades, the availability of a large amount of data has propelled the field of machine lea...
Crowdsourcing platforms offer a practical solution to the problem of afford-ably annotating large da...
Training data creation is increasingly a key bottleneck for developing machine learning, especially ...
This paper studies the active learning problem in crowdsourcing settings, where multiple imperfect a...
This paper presents a practical method for pool-based active learning that is robust to annotation n...
Crowd-sourcing has become a popular means of acquiring labeled data for many tasks where humans are ...
Automated activity recognition systems that use probabilistic models require labeled data sets in tr...
Conventional active learning algorithms assume a single labeler that produces noiseless label at a g...
<p>Most classic machine learning methods depend on the assumption that humans can annotate all the d...
Crowdsourcing has become an popular approach for annotating the large quantities of data required to...
Recognizing human activities from wearable sensor data is an important problem, particularly for hea...
With the proliferation of social media, gathering data has became cheaper and easier than before. Ho...
Active learning seeks to train the best classifier at the lowest annotation cost by intelligently pi...
Many machine learning datasets are noisy with a substantial number of mislabeled instances. This noi...
ii With the proliferation of social media, gathering data has became cheaper and easier than before....
In recent decades, the availability of a large amount of data has propelled the field of machine lea...
Crowdsourcing platforms offer a practical solution to the problem of afford-ably annotating large da...
Training data creation is increasingly a key bottleneck for developing machine learning, especially ...
This paper studies the active learning problem in crowdsourcing settings, where multiple imperfect a...
This paper presents a practical method for pool-based active learning that is robust to annotation n...
Crowd-sourcing has become a popular means of acquiring labeled data for many tasks where humans are ...
Automated activity recognition systems that use probabilistic models require labeled data sets in tr...
Conventional active learning algorithms assume a single labeler that produces noiseless label at a g...
<p>Most classic machine learning methods depend on the assumption that humans can annotate all the d...