With the exponential growth of the World Wide Web, automated subject classification has become a major research issue. Organizing web pages into a hierarchical structure for subject browsing has been gaining more recognition as an important tool in information-seeking processes.The most frequent approach to automated classification is machine learning. It, however, requires training documents and performs well on new documents only if they are similar enough to the former. In the thesis, a string-matching algorithm based on a controlled vocabulary was explored. It does not require training documents, but instead reuses the intellectual work invested into creating the controlled vocabulary. Terms from the Engineering Information thesaurus an...
As the dramatic expansion of online publications continues, state libraries urgently need effective ...
The thesis concerns the use of classification schemes for organising resources in subject-based h...
Purpose - The purpose of this study is twofold: to investigate whether it is meaningful to use the E...
While automated methods for information organization have been around for several decades now, expon...
With the exponential growth of the World Wide Web, automated subject classification of Web pages has...
The primary objective of this study was to identify and address problems of applying a controlled vo...
International audienceAutomated subject classification has been a challenging research issue for man...
Purpose - To provide an integrated perspective to similarities and differences between approaches to...
Purpose– To provide an integrated perspective to similarities and differences between approaches to ...
A machine-learning and a string-matching approach to automated subject classification of text were c...
Most of the research on text categorization has focused on classifying text documents into a set of ...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
With the explosive growth in the number of electronic documents available on the internet, intranets...
: We study the automatic classification of Web documents into pre-specified categories, with the ob...
This paper describes an experiment in applying standard supervised machine learning algorithms (C4.5...
As the dramatic expansion of online publications continues, state libraries urgently need effective ...
The thesis concerns the use of classification schemes for organising resources in subject-based h...
Purpose - The purpose of this study is twofold: to investigate whether it is meaningful to use the E...
While automated methods for information organization have been around for several decades now, expon...
With the exponential growth of the World Wide Web, automated subject classification of Web pages has...
The primary objective of this study was to identify and address problems of applying a controlled vo...
International audienceAutomated subject classification has been a challenging research issue for man...
Purpose - To provide an integrated perspective to similarities and differences between approaches to...
Purpose– To provide an integrated perspective to similarities and differences between approaches to ...
A machine-learning and a string-matching approach to automated subject classification of text were c...
Most of the research on text categorization has focused on classifying text documents into a set of ...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
With the explosive growth in the number of electronic documents available on the internet, intranets...
: We study the automatic classification of Web documents into pre-specified categories, with the ob...
This paper describes an experiment in applying standard supervised machine learning algorithms (C4.5...
As the dramatic expansion of online publications continues, state libraries urgently need effective ...
The thesis concerns the use of classification schemes for organising resources in subject-based h...
Purpose - The purpose of this study is twofold: to investigate whether it is meaningful to use the E...