The primary objective of this study was to identify and address problems of applying a controlled vocabulary in automated subject classification of textual Web pages, in the area of engineering. Web pages have special characteristics such as structural information, but are at the same time rather heterogeneous. The classification approach used comprises string-to-string matching between words in a term list extracted from the Ei (Engineering Information) thesaurus and classification scheme, and words in the text to be classified. Based on a sample of 70 Web pages, a number of problems with the term list are identified. Reasons for those problems are discussed and improvements proposed. Methods for implementing the improvements are also spec...
: We study the automatic classification of Web documents into pre-specified categories, with the ob...
Purpose - The purpose of this study is twofold: to investigate whether it is meaningful to use the E...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
The primary objective of this study was to identify and address problems of applying a controlled vo...
With the exponential growth of the World Wide Web, automated subject classification has become a maj...
Purpose - To provide an integrated perspective to similarities and differences between approaches to...
Purpose– To provide an integrated perspective to similarities and differences between approaches to ...
With the exponential growth of the World Wide Web, automated subject classification of Web pages has...
While automated methods for information organization have been around for several decades now, expon...
International audienceAutomated subject classification has been a challenging research issue for man...
The paper aims to explore to what degree different types of terms in engineering information (Ei) th...
The aim of the study was to determine how significance indicators assigned to different Web page ele...
A machine-learning and a string-matching approach to automated subject classification of text were c...
This paper presents problem of automatic webpages classification using association rules based class...
In this paper we discuss several issues related to automated text classification of web sites. We an...
: We study the automatic classification of Web documents into pre-specified categories, with the ob...
Purpose - The purpose of this study is twofold: to investigate whether it is meaningful to use the E...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
The primary objective of this study was to identify and address problems of applying a controlled vo...
With the exponential growth of the World Wide Web, automated subject classification has become a maj...
Purpose - To provide an integrated perspective to similarities and differences between approaches to...
Purpose– To provide an integrated perspective to similarities and differences between approaches to ...
With the exponential growth of the World Wide Web, automated subject classification of Web pages has...
While automated methods for information organization have been around for several decades now, expon...
International audienceAutomated subject classification has been a challenging research issue for man...
The paper aims to explore to what degree different types of terms in engineering information (Ei) th...
The aim of the study was to determine how significance indicators assigned to different Web page ele...
A machine-learning and a string-matching approach to automated subject classification of text were c...
This paper presents problem of automatic webpages classification using association rules based class...
In this paper we discuss several issues related to automated text classification of web sites. We an...
: We study the automatic classification of Web documents into pre-specified categories, with the ob...
Purpose - The purpose of this study is twofold: to investigate whether it is meaningful to use the E...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...