Purpose– To provide an integrated perspective to similarities and differences between approaches to automated classification in different research communities (machine learning, information retrieval and library science), and point to problems with the approaches and automated classification as such. Design/methodology/approach– A range of works dealing with automated classification of full‐text web documents are discussed. Explorations of individual approaches are given in the following sections: special features (description, differences, evaluation), application and characteristics of web pages. Findings– Provides major similarities and differences between the three approaches: document pre‐processing and utilization of web‐specific docu...
With the evolution of Internet, the meaning and accessibility of text documents and electronic infor...
In this paper we discuss several issues related to automated text classification of web sites. We an...
This paper describes an experiment in applying standard supervised machine learning algorithms (C4.5...
Purpose– To provide an integrated perspective to similarities and differences between approaches to ...
Purpose - To provide an integrated perspective to similarities and differences between approaches to...
With the exponential growth of the World Wide Web, automated subject classification of Web pages has...
With the exponential growth of the World Wide Web, automated subject classification has become a maj...
While automated methods for information organization have been around for several decades now, expon...
The primary objective of this study was to identify and address problems of applying a controlled vo...
A machine-learning and a string-matching approach to automated subject classification of text were c...
With the explosive growth in the number of electronic documents available on the internet, intranets...
: We study the automatic classification of Web documents into pre-specified categories, with the ob...
This paper deals with automatic classification of text documents, showing advantages of the classifi...
Genre characterizes text differently than the usual subject or prepositional content that has been t...
Most of the research on text categorization has focused on classifying text documents into a set of ...
With the evolution of Internet, the meaning and accessibility of text documents and electronic infor...
In this paper we discuss several issues related to automated text classification of web sites. We an...
This paper describes an experiment in applying standard supervised machine learning algorithms (C4.5...
Purpose– To provide an integrated perspective to similarities and differences between approaches to ...
Purpose - To provide an integrated perspective to similarities and differences between approaches to...
With the exponential growth of the World Wide Web, automated subject classification of Web pages has...
With the exponential growth of the World Wide Web, automated subject classification has become a maj...
While automated methods for information organization have been around for several decades now, expon...
The primary objective of this study was to identify and address problems of applying a controlled vo...
A machine-learning and a string-matching approach to automated subject classification of text were c...
With the explosive growth in the number of electronic documents available on the internet, intranets...
: We study the automatic classification of Web documents into pre-specified categories, with the ob...
This paper deals with automatic classification of text documents, showing advantages of the classifi...
Genre characterizes text differently than the usual subject or prepositional content that has been t...
Most of the research on text categorization has focused on classifying text documents into a set of ...
With the evolution of Internet, the meaning and accessibility of text documents and electronic infor...
In this paper we discuss several issues related to automated text classification of web sites. We an...
This paper describes an experiment in applying standard supervised machine learning algorithms (C4.5...