International audienceLarge-scale annotated corpora have yielded impressive performance improvements in computer vision and multimedia content analysis. However, such datasets depend on an enormous amount of human labeling effort. When the labels correspond to well-known concepts, it is straightforward to train the annotators by giving a few examples with known answers. It is also straightforward to judge the quality of their labels. Neither is true when there are thousands of complex domain-specific labels. Training on all labels is infeasible and the quality of an annotator's judgements may be vastly different for some subsets of labels than for others. This paper proposes a set of data-driven algorithms to 1) train image annotators on ho...
This paper studies the active learning problem in crowdsourcing settings, where multiple imperfect a...
Machine learning applications can benefit greatly from vast amounts of data, provided that reliable ...
ii With the proliferation of social media, gathering data has became cheaper and easier than before....
International audienceLarge-scale annotated corpora have yielded impressive performance improvements...
With the proliferation of social media, gathering data has became cheaper and easier than before. Ho...
Labeling large datasets has become faster, cheaper, and easier with the advent of crowdsourcing ser...
The success of deep learning in image recognition is substantially driven by large-scale, well-curat...
We introduce a method for efficiently crowdsourcing multiclass annotations in challenging, real worl...
Distributing labeling tasks among hundreds or thousands of annotators is an increasingly important m...
High-quality data is necessary for modern machine learning. However, the acquisition of such data is...
The creation of golden standard datasets is a costly business. Optimally more than one judgment per ...
Crowdsourcing platforms offer a practical solution to the problem of afford-ably annotating large da...
Labeled data is a prerequisite for successfully applying machine learning techniques to a wide range...
The supervised learning-based recommendation models, whose infrastructures are sufficient training s...
Crowdsourcing is a popular cheap alternative in machine learning for gathering information from a se...
This paper studies the active learning problem in crowdsourcing settings, where multiple imperfect a...
Machine learning applications can benefit greatly from vast amounts of data, provided that reliable ...
ii With the proliferation of social media, gathering data has became cheaper and easier than before....
International audienceLarge-scale annotated corpora have yielded impressive performance improvements...
With the proliferation of social media, gathering data has became cheaper and easier than before. Ho...
Labeling large datasets has become faster, cheaper, and easier with the advent of crowdsourcing ser...
The success of deep learning in image recognition is substantially driven by large-scale, well-curat...
We introduce a method for efficiently crowdsourcing multiclass annotations in challenging, real worl...
Distributing labeling tasks among hundreds or thousands of annotators is an increasingly important m...
High-quality data is necessary for modern machine learning. However, the acquisition of such data is...
The creation of golden standard datasets is a costly business. Optimally more than one judgment per ...
Crowdsourcing platforms offer a practical solution to the problem of afford-ably annotating large da...
Labeled data is a prerequisite for successfully applying machine learning techniques to a wide range...
The supervised learning-based recommendation models, whose infrastructures are sufficient training s...
Crowdsourcing is a popular cheap alternative in machine learning for gathering information from a se...
This paper studies the active learning problem in crowdsourcing settings, where multiple imperfect a...
Machine learning applications can benefit greatly from vast amounts of data, provided that reliable ...
ii With the proliferation of social media, gathering data has became cheaper and easier than before....