International audienceMost existing active learning methods for classification, assume that the observed labels (i.e. given by a human labeller) are perfectly correct. However, in real world applications, the labeller is usually subject to labelling errors that reduce the classification accuracy of the learned model. In this paper, we address this issue for active learning in the streaming setting and we try to answer the following questions: (1) which labelled instances are most likely to be mislabelled? (2) is it always good to abstain from learning when data is suspected to be mislabelled? (3) which mislabelled instances require relabelling? We propose a hybrid active learning strategy based on two measures. The first measureallows to fi...
Conventional active learning algorithms assume a single labeler that produces noiseless label at a g...
Data labeling is an expensive and time-consuming task. Choosing which labels to use is increasingly ...
This thesis focuses on machine learning for data classification. To reduce the labelling cost, activ...
International audienceMost existing active learning methods for classification, assume that the obse...
International audienceMislabelling is a critical problem for stream-based active learning methods be...
Active learning methods have been proposed to reduce the labeling effort of human experts: based on ...
Abstract. We propose a new active learning framework where the ex-pert labeler is allowed to decline...
Active learning seeks to train the best classifier at the lowest annotation cost by intelligently pi...
In this position paper we introduce Active In-ference, a paradigm for intelligently request-ing huma...
In many settings in practice it is expensive to obtain labeled data while unlabeled data is abundant...
Active learning aims to label the most informative data points in order to minimize the cost of lab...
<p>Most classic machine learning methods depend on the assumption that humans can annotate all the d...
International audienceThis paper addresses stream-based active learning for classification. We propo...
In recent decades, the availability of a large amount of data has propelled the field of machine lea...
We study the problem of active learning in a stream-based setting, allowing the distribution of the ...
Conventional active learning algorithms assume a single labeler that produces noiseless label at a g...
Data labeling is an expensive and time-consuming task. Choosing which labels to use is increasingly ...
This thesis focuses on machine learning for data classification. To reduce the labelling cost, activ...
International audienceMost existing active learning methods for classification, assume that the obse...
International audienceMislabelling is a critical problem for stream-based active learning methods be...
Active learning methods have been proposed to reduce the labeling effort of human experts: based on ...
Abstract. We propose a new active learning framework where the ex-pert labeler is allowed to decline...
Active learning seeks to train the best classifier at the lowest annotation cost by intelligently pi...
In this position paper we introduce Active In-ference, a paradigm for intelligently request-ing huma...
In many settings in practice it is expensive to obtain labeled data while unlabeled data is abundant...
Active learning aims to label the most informative data points in order to minimize the cost of lab...
<p>Most classic machine learning methods depend on the assumption that humans can annotate all the d...
International audienceThis paper addresses stream-based active learning for classification. We propo...
In recent decades, the availability of a large amount of data has propelled the field of machine lea...
We study the problem of active learning in a stream-based setting, allowing the distribution of the ...
Conventional active learning algorithms assume a single labeler that produces noiseless label at a g...
Data labeling is an expensive and time-consuming task. Choosing which labels to use is increasingly ...
This thesis focuses on machine learning for data classification. To reduce the labelling cost, activ...