In some real-world applications, it is time-consuming or expensive to collect much labeled data, while unlabeled data is easier to obtain. Many semi-supervised learning methods have been proposed to deal with this problem by utilizing the unlabeled data. On the other hand, on some datasets, misclassifying different classes causes different costs, which challenges the common assumption in classification that classes have the same misclassification cost. For example, misclassifying a fraud as a legitimate transaction could be more serious than misclassifying a legitimate transaction as fraudulent. In this paper, we propose a cost-sensitive self-training method (CS-ST) to improve the performance of Naive Bayes when labeled instances are scarce...
This paper presents a new approach to identifying and eliminating mislabeled training instances for ...
In many important text classification problems, acquiring class labels for training documents is cos...
Abstract- The classifier built from a data set with a highly skewed class distribution generally pre...
It is an actual and challenging issue to learn cost-sensitive models from those datasets that are wi...
There is a significant body of research in machine learning addressing techniques for performing cla...
Abstract—Practical machine learning and data mining prob-lems often face shortage of labeled trainin...
We present a new machine learning framework called “self-taught learning ” for using unlabeled data ...
We present a new machine learning framework called “self-taught learning ” for using unlabeled data ...
In this paper, we study cost-sensitive semi-supervised learning where many of the training examples ...
In this paper, we study cost-sensitive semi-supervised learning where many of the training examples ...
Cost-sensitive classification is one of mainstream research topics in data mining and machine learni...
Many factors influence the performance of a learned classifier. In this paper we study different met...
One problem of data-driven answer extraction in open-domain factoid question answering is that the c...
. Many factors influence a learning process and the performance of a learned classifier. In this pap...
This paper addresses cost-sensitive classification in the setting where there are costs for measurin...
This paper presents a new approach to identifying and eliminating mislabeled training instances for ...
In many important text classification problems, acquiring class labels for training documents is cos...
Abstract- The classifier built from a data set with a highly skewed class distribution generally pre...
It is an actual and challenging issue to learn cost-sensitive models from those datasets that are wi...
There is a significant body of research in machine learning addressing techniques for performing cla...
Abstract—Practical machine learning and data mining prob-lems often face shortage of labeled trainin...
We present a new machine learning framework called “self-taught learning ” for using unlabeled data ...
We present a new machine learning framework called “self-taught learning ” for using unlabeled data ...
In this paper, we study cost-sensitive semi-supervised learning where many of the training examples ...
In this paper, we study cost-sensitive semi-supervised learning where many of the training examples ...
Cost-sensitive classification is one of mainstream research topics in data mining and machine learni...
Many factors influence the performance of a learned classifier. In this paper we study different met...
One problem of data-driven answer extraction in open-domain factoid question answering is that the c...
. Many factors influence a learning process and the performance of a learned classifier. In this pap...
This paper addresses cost-sensitive classification in the setting where there are costs for measurin...
This paper presents a new approach to identifying and eliminating mislabeled training instances for ...
In many important text classification problems, acquiring class labels for training documents is cos...
Abstract- The classifier built from a data set with a highly skewed class distribution generally pre...