Many document classification applications require human understanding of the reasons for data-driven classification decisions by managers, client-facing employees, and the technical team. Predictive models treat documents as data to be classified, and document data are characterized by very high dimensionality, often with tens of thousands to millions of variables (words). Unfortunately, due to the high dimensionality, understanding the decisions made by document classifiers is very difficult. This paper begins by extending the most relevant prior theoretical model of explanations for intelligent systems to account for some missing elements. The main theoretical contribution is the definition of a new sort of explanation as a minimal set of...
With the explosive growth of the volume and complexity of document data (e.g., news, blogs, web page...
Text documents can be described by a number of abstract concepts such as semantic category, writing ...
Given a written text in natural language, it is convenient to represent the information content of t...
Many document classification applications require human understanding of the reasons for data-driven...
This is a design-science paper about methods for explaining data-driven classifications of text docum...
Conventionally, document classification researches focus on improving the learning capabilities of c...
The purpose of this thesis has been to evaluate if a new instance based explanation method, called A...
Humans are remarkably adept at classifying text documents into cate-gories. For instance, while rea...
Classification of documents involves three distinct major pro-cesses. The first two processes of def...
A wide variety of text analysis applications are based on statistical machine learning techniques. T...
As the amount of data online grows, the urge to use this data for different applications grows as we...
Automatic text classification is the process of automatically classifying text documents into pre-de...
One limitation of most existing probabilistic latent topic models for document classification is tha...
This paper describes automatic document categorization based on large text hierarchy. We handle the...
We are living in the age of internet where massive amount of information is produced from various di...
With the explosive growth of the volume and complexity of document data (e.g., news, blogs, web page...
Text documents can be described by a number of abstract concepts such as semantic category, writing ...
Given a written text in natural language, it is convenient to represent the information content of t...
Many document classification applications require human understanding of the reasons for data-driven...
This is a design-science paper about methods for explaining data-driven classifications of text docum...
Conventionally, document classification researches focus on improving the learning capabilities of c...
The purpose of this thesis has been to evaluate if a new instance based explanation method, called A...
Humans are remarkably adept at classifying text documents into cate-gories. For instance, while rea...
Classification of documents involves three distinct major pro-cesses. The first two processes of def...
A wide variety of text analysis applications are based on statistical machine learning techniques. T...
As the amount of data online grows, the urge to use this data for different applications grows as we...
Automatic text classification is the process of automatically classifying text documents into pre-de...
One limitation of most existing probabilistic latent topic models for document classification is tha...
This paper describes automatic document categorization based on large text hierarchy. We handle the...
We are living in the age of internet where massive amount of information is produced from various di...
With the explosive growth of the volume and complexity of document data (e.g., news, blogs, web page...
Text documents can be described by a number of abstract concepts such as semantic category, writing ...
Given a written text in natural language, it is convenient to represent the information content of t...