A selective sampling algorithm is a learning algorithm for classification that, based on the past observed data, decides whether to sample the label of each new instance to be classified. In this paper we introduce a general technique for turning linear-threshold classification algorithms from the general additive family into randomized selective sampling algorithms. For the most popular algorithms in this family we derive mistake bounds that hold for individual sequences of examples. These bounds show that our semi-supervised algorithms can achieve, on average, the same accuracy as that of their fully supervised counterparts, but using much fewer labels. Our theoretical results are corroborated by a number of experiments on real-world t...
Heavy label noise is often present in many practical scenarios where observed labels of instances ar...
In this paper, we investigate how systemic errors due to random sampling impact on automated algorit...
This paper introduces a new method for learning algorithm evaluation and selection, with empirical r...
A selective sampling algorithm is a learning algorithm for classification that, based on the past o...
Online selective sampling algorithms learn to perform binary classification, and additionally they d...
We introduce efficient margin-based algorithms for selective sampling and filtering in binary classi...
Abstract—Traditional online learning algorithms are designed for vector data only, which assume that...
. Most existing inductive learning algorithms assume the availability of a training set of labeled ...
Abstract. Most existing inductive learning algorithms work under the assumption that their training ...
Most positive and unlabeled data is subject to selection biases. The labeled examples can, for examp...
Recently there has been much work on selec-tive sampling, an online active learning setting, in whic...
Selective sampling is an active variant of online learning in which the learner is allowed to adapti...
Valiant (1984) and others have studied the problem of learning vari-ous classes of Boolean functions...
Abstract. In many practical domains, misclassification costs can differ greatly and may be represent...
Learning to Rank is the task of learning a ranking function from a set of query-documents pairs. Gen...
Heavy label noise is often present in many practical scenarios where observed labels of instances ar...
In this paper, we investigate how systemic errors due to random sampling impact on automated algorit...
This paper introduces a new method for learning algorithm evaluation and selection, with empirical r...
A selective sampling algorithm is a learning algorithm for classification that, based on the past o...
Online selective sampling algorithms learn to perform binary classification, and additionally they d...
We introduce efficient margin-based algorithms for selective sampling and filtering in binary classi...
Abstract—Traditional online learning algorithms are designed for vector data only, which assume that...
. Most existing inductive learning algorithms assume the availability of a training set of labeled ...
Abstract. Most existing inductive learning algorithms work under the assumption that their training ...
Most positive and unlabeled data is subject to selection biases. The labeled examples can, for examp...
Recently there has been much work on selec-tive sampling, an online active learning setting, in whic...
Selective sampling is an active variant of online learning in which the learner is allowed to adapti...
Valiant (1984) and others have studied the problem of learning vari-ous classes of Boolean functions...
Abstract. In many practical domains, misclassification costs can differ greatly and may be represent...
Learning to Rank is the task of learning a ranking function from a set of query-documents pairs. Gen...
Heavy label noise is often present in many practical scenarios where observed labels of instances ar...
In this paper, we investigate how systemic errors due to random sampling impact on automated algorit...
This paper introduces a new method for learning algorithm evaluation and selection, with empirical r...