Web includes digital libraries and billions of text documents. A fast and simple search through this sizeable set is important for users and researchers. Since manual or rule based document classification is a difficult, time consuming process, automatic classification systems are absolutely needed. Automatic text classification systems demand extensive and proper training data sets. To provide these data sets, usually, numerous unlabeled documents are labeled manually by experts. Manual labeling of documents is a difficult and time consuming process. Moreover, in manual labeling, due to human exhaustion and carelessness, there is the possibility of mistakes. In this study, semi-automatic creation of training data set has been proposed in a...
With the explosive growth in the number of electronic documents available on the internet, intranets...
59 p.In this thesis, an algorithm is presented that selects samples of documents for training text c...
In text classification the amount and quality of training data is crucial for the performance of the...
This paper addresses the problem of semi-supervised classification on document collections using re...
This paper addresses the problem of semi-supervised classification on document collections using ret...
We describe work on automatically assigning labels to books using user-defined tags as the label set...
In many important text classification problems, acquiring class labels for training documents is cos...
Abstract. A major difficulty of supervised approaches for text classification is that they require a...
In text classification the amount and quality of training data is crucial for the performance of the...
In text classification the amount and quality of training data is crucial for the performance of the...
This paper addresses the problem of semi-supervised classification on document collections using ret...
In many machine learning problem domains large amounts of data are available but the cost of correct...
The aim of this thesis is to minimize manual work needed to create training data for text classifica...
One of the most pervasive challenges in adopting machine or deep learning is the scarcity of trainin...
Automatic text classification has a long history and many studies have been conducted in this field....
With the explosive growth in the number of electronic documents available on the internet, intranets...
59 p.In this thesis, an algorithm is presented that selects samples of documents for training text c...
In text classification the amount and quality of training data is crucial for the performance of the...
This paper addresses the problem of semi-supervised classification on document collections using re...
This paper addresses the problem of semi-supervised classification on document collections using ret...
We describe work on automatically assigning labels to books using user-defined tags as the label set...
In many important text classification problems, acquiring class labels for training documents is cos...
Abstract. A major difficulty of supervised approaches for text classification is that they require a...
In text classification the amount and quality of training data is crucial for the performance of the...
In text classification the amount and quality of training data is crucial for the performance of the...
This paper addresses the problem of semi-supervised classification on document collections using ret...
In many machine learning problem domains large amounts of data are available but the cost of correct...
The aim of this thesis is to minimize manual work needed to create training data for text classifica...
One of the most pervasive challenges in adopting machine or deep learning is the scarcity of trainin...
Automatic text classification has a long history and many studies have been conducted in this field....
With the explosive growth in the number of electronic documents available on the internet, intranets...
59 p.In this thesis, an algorithm is presented that selects samples of documents for training text c...
In text classification the amount and quality of training data is crucial for the performance of the...