We describe the National Research Council's (NRC) entry in the Anomaly Detection/Text Mining competition organized at the Text Mining Workshop 2007. This entry relies on a straightforward implementation of a probabilistic categorizer described earlier [GGPC02]. This categorizer is adapted to handle multiple labeling and a piecewise-linear confidence estimation layer is added to provide an estimate of the labeling confidence. This technique achieves a score of 1.689 on the test data. This model has potentially useful features and extensions such as the use of a category-specific decision layer or the extraction of descriptive category keywords from the probabilistic profile.NRC publication: Ye
This paper experimentally reports on Bayesian predictive uncertainty for real-world text classificat...
The automatic categorisation of web documents is be-coming crucial for organising the huge amount of...
We present an approach to text categorization using machine learning techniques. The approach is dev...
We describe NRC's submission to the Anomaly Detection/Text Mining competition organised at the Text ...
We describe NRC's submission to the Anomaly Detection/Text Mining competition organised at the Text ...
Text categorization is the classification of documents with respect to a set of predefined categorie...
In \ac{ATC}, a general inductive process automatically builds a classifier for the categories involv...
This paper presents the probabilistic model named TwodimensionalProbabilistic Model (2DPM). In this ...
Text data mining is the process of extracting and analyzing valuable information from text. A text d...
Kilimci, Zeynep Hilal (Dogus Author) -- Conference full title: IEEE International Symposium on INnov...
Bulgarian National Science Fund;Bulgarian Section2019 IEEE International Symposium on INnovations in...
Text classification is the task of assigning predefined categories to free text documents. Due to th...
Abstract. In this paper, we propose a probabilistic approach to fea-ture selection for multi-class t...
This paper presents an extension of prior work by Michael D. Lee on psychologically plausible text c...
This paper presents an extension of prior work by Michael D. Lee on psychologically plausible text c...
This paper experimentally reports on Bayesian predictive uncertainty for real-world text classificat...
The automatic categorisation of web documents is be-coming crucial for organising the huge amount of...
We present an approach to text categorization using machine learning techniques. The approach is dev...
We describe NRC's submission to the Anomaly Detection/Text Mining competition organised at the Text ...
We describe NRC's submission to the Anomaly Detection/Text Mining competition organised at the Text ...
Text categorization is the classification of documents with respect to a set of predefined categorie...
In \ac{ATC}, a general inductive process automatically builds a classifier for the categories involv...
This paper presents the probabilistic model named TwodimensionalProbabilistic Model (2DPM). In this ...
Text data mining is the process of extracting and analyzing valuable information from text. A text d...
Kilimci, Zeynep Hilal (Dogus Author) -- Conference full title: IEEE International Symposium on INnov...
Bulgarian National Science Fund;Bulgarian Section2019 IEEE International Symposium on INnovations in...
Text classification is the task of assigning predefined categories to free text documents. Due to th...
Abstract. In this paper, we propose a probabilistic approach to fea-ture selection for multi-class t...
This paper presents an extension of prior work by Michael D. Lee on psychologically plausible text c...
This paper presents an extension of prior work by Michael D. Lee on psychologically plausible text c...
This paper experimentally reports on Bayesian predictive uncertainty for real-world text classificat...
The automatic categorisation of web documents is be-coming crucial for organising the huge amount of...
We present an approach to text categorization using machine learning techniques. The approach is dev...