As the digital age pushes forward, data and document size have been increasing rapidly. A more efficient and accurate method of sampling data for training text classifiers is required. We require good samples and not just blind samples from Simple Random Sampling, therefore we experimented on a new proposed sampling algorithm – CONCISE. It is a novel sampling algorithm that is proposed for selecting training documents for text classification and experiments showed that it works particularly well with small sampling ratio. Experiments were conducted on the 20 Newsgroup corpus and Reuters 21578 document set using two classifiers SVM and Naïve Bayes classifier. CONCISE is compared with SRS in all experiments and results showed that CONCISE is ...
Text classification via supervised learning involves various steps from processing raw data, featur...
Abstract. Support vector machines (SVMs) have shown su-perb performance for text classification task...
Text categorization is the task of discovering the category or class text documents belongs to, or i...
59 p.In this thesis, an algorithm is presented that selects samples of documents for training text c...
Abstract. In order to reduce human efforts, there has been increasing interest in applying active le...
This paper studies training set sampling strategies in the context of statistical learning for text ...
Abstract. A central problem in information retrieval is the automated classification of text documen...
Text classification is a powerful technique for automating assignment of documents to topic hierarch...
Text classification is the process in which text document is assigned to one or more predefined cate...
The Text mining and Data mining supports different kinds of algorithms for classification of large d...
Text classification involves deciding whether or not a document is about a given topic. It is an imp...
In this paper, we address the problem of dealing with a large collection of data and propose a met...
Under the era of technical surge in recent years, the weight of artificial intelligence in people\u2...
Text classification is used to classify the document of similar types . Text classification can be a...
This paper gives a comparison of frequently used classifier models for text classification in the re...
Text classification via supervised learning involves various steps from processing raw data, featur...
Abstract. Support vector machines (SVMs) have shown su-perb performance for text classification task...
Text categorization is the task of discovering the category or class text documents belongs to, or i...
59 p.In this thesis, an algorithm is presented that selects samples of documents for training text c...
Abstract. In order to reduce human efforts, there has been increasing interest in applying active le...
This paper studies training set sampling strategies in the context of statistical learning for text ...
Abstract. A central problem in information retrieval is the automated classification of text documen...
Text classification is a powerful technique for automating assignment of documents to topic hierarch...
Text classification is the process in which text document is assigned to one or more predefined cate...
The Text mining and Data mining supports different kinds of algorithms for classification of large d...
Text classification involves deciding whether or not a document is about a given topic. It is an imp...
In this paper, we address the problem of dealing with a large collection of data and propose a met...
Under the era of technical surge in recent years, the weight of artificial intelligence in people\u2...
Text classification is used to classify the document of similar types . Text classification can be a...
This paper gives a comparison of frequently used classifier models for text classification in the re...
Text classification via supervised learning involves various steps from processing raw data, featur...
Abstract. Support vector machines (SVMs) have shown su-perb performance for text classification task...
Text categorization is the task of discovering the category or class text documents belongs to, or i...