While automated methods for information organization have been around for several decades now, exponential growth of the World Wide Web has put them into the forefront of research in different communities, within which several approaches can be identified: 1) machine learning (algorithms that allow computers to improve their performance based on learning from pre-existing data); 2) document clustering (algorithms for unsupervised document organization and automated topic extraction); and 3) string matching (algorithms that match given strings within larger text). Here the aim was to automatically organize textual documents into hierarchical structures for subject browsing. The string-matching approach was tested using a controlled vocabular...
This paper describes the usage of machine learning techniques to assign keywords to documents. The l...
As the dramatic expansion of online publications continues, state libraries urgently need effective ...
Abstract. In this paper, the problem of classifying a HTML documents into a hierarchy of categories ...
While automated methods for information organization have been around for several decades now, expon...
With the exponential growth of the World Wide Web, automated subject classification has become a maj...
With the exponential growth of the World Wide Web, automated subject classification of Web pages has...
Purpose– To provide an integrated perspective to similarities and differences between approaches to ...
Purpose - To provide an integrated perspective to similarities and differences between approaches to...
The primary objective of this study was to identify and address problems of applying a controlled vo...
Most of the research on text categorization has focused on classifying text documents into a set of ...
A machine-learning and a string-matching approach to automated subject classification of text were c...
International audienceAutomated subject classification has been a challenging research issue for man...
This paper describes an experiment in applying standard supervised machine learning algorithms (C4.5...
In this paper, the problem of classifying a HTML documents into a hierarchy of categories is invest...
Abstract. This paper describes an intelligent information system for effectively managing huge amoun...
This paper describes the usage of machine learning techniques to assign keywords to documents. The l...
As the dramatic expansion of online publications continues, state libraries urgently need effective ...
Abstract. In this paper, the problem of classifying a HTML documents into a hierarchy of categories ...
While automated methods for information organization have been around for several decades now, expon...
With the exponential growth of the World Wide Web, automated subject classification has become a maj...
With the exponential growth of the World Wide Web, automated subject classification of Web pages has...
Purpose– To provide an integrated perspective to similarities and differences between approaches to ...
Purpose - To provide an integrated perspective to similarities and differences between approaches to...
The primary objective of this study was to identify and address problems of applying a controlled vo...
Most of the research on text categorization has focused on classifying text documents into a set of ...
A machine-learning and a string-matching approach to automated subject classification of text were c...
International audienceAutomated subject classification has been a challenging research issue for man...
This paper describes an experiment in applying standard supervised machine learning algorithms (C4.5...
In this paper, the problem of classifying a HTML documents into a hierarchy of categories is invest...
Abstract. This paper describes an intelligent information system for effectively managing huge amoun...
This paper describes the usage of machine learning techniques to assign keywords to documents. The l...
As the dramatic expansion of online publications continues, state libraries urgently need effective ...
Abstract. In this paper, the problem of classifying a HTML documents into a hierarchy of categories ...