Many document classification applications require human understanding of the reasons for data-driven classification decisions: by managers, client-facing employees, and the technical team. Predictive models treat documents as data to be classified, and document data are characterized by very high dimensionality, often with tens of thousands to millions of variables (words). Unfortunately, due to the high dimensionality, understanding the decisions made by document classifiers is very difficult. This paper begins by extending the most relevant prior theoretical model of explanations for intelligent systems to account for some missing elements. The main theoretical contribution of the work is the definition of a new sort of explanation as a m...
With the explosive growth of the volume and complexity of document data (e.g., news, blogs, web page...
Machine Learning approach to text classification has been the dominant method in the research and ap...
US corporations regularly spend millions of dollars reviewing electronically-stored documents in leg...
Many document classification applications require human understanding of the reasons for data-driven...
This is a design-science paper about methods for explaining data-driven classifications of text docum...
Humans are remarkably adept at classifying text documents into cate-gories. For instance, while rea...
Conventionally, document classification researches focus on improving the learning capabilities of c...
Text classification is the undertaking of naturally sorting an arrangement of archives into classifi...
We are living in the age of internet where massive amount of information is produced from various di...
The purpose of this thesis has been to evaluate if a new instance based explanation method, called A...
As the amount of data online grows, the urge to use this data for different applications grows as we...
This thesis explores a new approach to automatic characterisation of business documents of different...
In this paper we presented a lot of experiments that examine how the particular parts of the documen...
Effective decision making is based on accurate and timely information. However, human decision maker...
Classification of documents involves three distinct major pro-cesses. The first two processes of def...
With the explosive growth of the volume and complexity of document data (e.g., news, blogs, web page...
Machine Learning approach to text classification has been the dominant method in the research and ap...
US corporations regularly spend millions of dollars reviewing electronically-stored documents in leg...
Many document classification applications require human understanding of the reasons for data-driven...
This is a design-science paper about methods for explaining data-driven classifications of text docum...
Humans are remarkably adept at classifying text documents into cate-gories. For instance, while rea...
Conventionally, document classification researches focus on improving the learning capabilities of c...
Text classification is the undertaking of naturally sorting an arrangement of archives into classifi...
We are living in the age of internet where massive amount of information is produced from various di...
The purpose of this thesis has been to evaluate if a new instance based explanation method, called A...
As the amount of data online grows, the urge to use this data for different applications grows as we...
This thesis explores a new approach to automatic characterisation of business documents of different...
In this paper we presented a lot of experiments that examine how the particular parts of the documen...
Effective decision making is based on accurate and timely information. However, human decision maker...
Classification of documents involves three distinct major pro-cesses. The first two processes of def...
With the explosive growth of the volume and complexity of document data (e.g., news, blogs, web page...
Machine Learning approach to text classification has been the dominant method in the research and ap...
US corporations regularly spend millions of dollars reviewing electronically-stored documents in leg...