© 2013 Dr. Clinton BurfordInformation systems are transforming the ways in which people generate, store and share information. One consequence of this change is a massive increase in the quantity of digital content the average person needs to deal with. A large part of the information systems challenge is about finding intelligent ways to help users locate and analyse this information. One tool that is available to build systems to address this challenge is automatic document classification. A document classifier is a statistical model for predicting a label for an input document that is represented as a set of features. The potential usefulness of such a generalised system for categorising docum...
In the information age we much depend on our ability to find information hidden in mostly unstructur...
Automatic document classification and clustering are useful for a wide range of applications such as...
Many document classification applications require human understanding of the reasons for data-driven...
In this thesis we describe a method of using associative networks for automatic doc- ument grouping....
In general, document classification research focuses on the automated placement of unseen documents ...
A wide variety of text analysis applications are based on statistical machine learning techniques. T...
Automated document classification process extracts information with a systematic analysis of the con...
Numerous initiatives have allowed users to share knowledge or opinions using collaborative platforms...
Discovering relationships among concepts and categories is crucial in various information systems. T...
Because of the explosion of digital and online text information, automatic organization of documents...
Multi-label classification is a generalization of a broader concept of multi-class classification in...
We discuss here the search for inter-document references as an alternative to the grouping of docume...
This thesis explores multimodal document classification algorithms in a unified framework. Classific...
Traditional document clustering approaches are usually based on the Bag of Words model, which is lim...
Document classification and provenance has become an important area of computer science as the amoun...
In the information age we much depend on our ability to find information hidden in mostly unstructur...
Automatic document classification and clustering are useful for a wide range of applications such as...
Many document classification applications require human understanding of the reasons for data-driven...
In this thesis we describe a method of using associative networks for automatic doc- ument grouping....
In general, document classification research focuses on the automated placement of unseen documents ...
A wide variety of text analysis applications are based on statistical machine learning techniques. T...
Automated document classification process extracts information with a systematic analysis of the con...
Numerous initiatives have allowed users to share knowledge or opinions using collaborative platforms...
Discovering relationships among concepts and categories is crucial in various information systems. T...
Because of the explosion of digital and online text information, automatic organization of documents...
Multi-label classification is a generalization of a broader concept of multi-class classification in...
We discuss here the search for inter-document references as an alternative to the grouping of docume...
This thesis explores multimodal document classification algorithms in a unified framework. Classific...
Traditional document clustering approaches are usually based on the Bag of Words model, which is lim...
Document classification and provenance has become an important area of computer science as the amoun...
In the information age we much depend on our ability to find information hidden in mostly unstructur...
Automatic document classification and clustering are useful for a wide range of applications such as...
Many document classification applications require human understanding of the reasons for data-driven...