Real-world text classification tasks often suffer from poor class structure with many overlapping classes and blurred boundaries. Training data pooled from multiple sources tend to be inconsistent and contain erroneous labelling, leading to poor performance of standard text classifiers. The classification of health service products to specialized procurement classes is used to examine and quantify the extent of these problems. A novel method is presented to analyze the labelled data by selectively merging classes where there is not enough information for the classifier to distinguish them. Initial results show the method can identify the most problematic classes, which can be used either as a focus to improve the training data or to merge c...
Objective The US Vaccine Adverse Event Reporting System (VAERS) collects spontaneous reports of adve...
Pattern classification systems play an important role in medical decision support. They allow to aut...
Many real-world text classification tasks involve imbalanced training examples. The strategies propo...
Text classification is an active research area motivated by many real-world applications. Even so, r...
A common classification task of today is classifying resources that consist of words. Nondiscriminat...
Text Mining is the discovery of valuable, yet hidden, information from the text document. Text class...
Abstract- Naïve-Bayes and k-NN classifiers are two machine learning approaches for text classificati...
Most existing classification approaches assume the underlying training set is evenly distributed. In...
This paper gives a comparison of frequently used classifier models for text classification in the re...
Automatic text classification has a long history and many studies have been conducted in this litera...
Abstract: This paper analyzes confusion class phenomena existing in text classification procedure, a...
This paper investigates the problem of text classification. The task of text classification is to as...
We introduce the evolving label-set problem encountered in building real-world text classification s...
Automatic text classification has a long history and many studies have been conducted in this field....
Objective: To examine the feasibility of using statistical text classification to automatically iden...
Objective The US Vaccine Adverse Event Reporting System (VAERS) collects spontaneous reports of adve...
Pattern classification systems play an important role in medical decision support. They allow to aut...
Many real-world text classification tasks involve imbalanced training examples. The strategies propo...
Text classification is an active research area motivated by many real-world applications. Even so, r...
A common classification task of today is classifying resources that consist of words. Nondiscriminat...
Text Mining is the discovery of valuable, yet hidden, information from the text document. Text class...
Abstract- Naïve-Bayes and k-NN classifiers are two machine learning approaches for text classificati...
Most existing classification approaches assume the underlying training set is evenly distributed. In...
This paper gives a comparison of frequently used classifier models for text classification in the re...
Automatic text classification has a long history and many studies have been conducted in this litera...
Abstract: This paper analyzes confusion class phenomena existing in text classification procedure, a...
This paper investigates the problem of text classification. The task of text classification is to as...
We introduce the evolving label-set problem encountered in building real-world text classification s...
Automatic text classification has a long history and many studies have been conducted in this field....
Objective: To examine the feasibility of using statistical text classification to automatically iden...
Objective The US Vaccine Adverse Event Reporting System (VAERS) collects spontaneous reports of adve...
Pattern classification systems play an important role in medical decision support. They allow to aut...
Many real-world text classification tasks involve imbalanced training examples. The strategies propo...