Purpose - To provide an integrated perspective to similarities and differences between approaches to automated classification in different research communities (machine learning, information retrieval and library science), and point to problems with the approaches and automated classification as such. Design/methodology/approach - A range of works dealing with automated classification of full-text web documents are discussed. Explorations of individual approaches are given in the following sections: special features (description, differences, evaluation), application and characteristics of web pages. Findings - Provides major similarities and differences between the three approaches: document pre-processing and utilization of web-specific d...
This bachelor's thesis deals with automatic document topic classification and provides a brief intro...
This paper describes an experiment in applying standard supervised machine learning algorithms (C4.5...
The internet is frequently surfed by people by using smartphones, laptops, or computers in order to ...
Purpose– To provide an integrated perspective to similarities and differences between approaches to ...
With the exponential growth of the World Wide Web, automated subject classification of Web pages has...
With the exponential growth of the World Wide Web, automated subject classification has become a maj...
While automated methods for information organization have been around for several decades now, expon...
The primary objective of this study was to identify and address problems of applying a controlled vo...
This paper has been peer-reviewed but does not include the final publisher proof-corrections or jour...
This paper deals with automatic classification of text documents, showing advantages of the classifi...
: We study the automatic classification of Web documents into pre-specified categories, with the ob...
Genre characterizes text differently than the usual subject or prepositional content that has been t...
With the explosive growth in the number of electronic documents available on the internet, intranets...
With the evolution of Internet, the meaning and accessibility of text documents and electronic infor...
Automatic document classification techniques have been widely advocated for the study of various fi...
This bachelor's thesis deals with automatic document topic classification and provides a brief intro...
This paper describes an experiment in applying standard supervised machine learning algorithms (C4.5...
The internet is frequently surfed by people by using smartphones, laptops, or computers in order to ...
Purpose– To provide an integrated perspective to similarities and differences between approaches to ...
With the exponential growth of the World Wide Web, automated subject classification of Web pages has...
With the exponential growth of the World Wide Web, automated subject classification has become a maj...
While automated methods for information organization have been around for several decades now, expon...
The primary objective of this study was to identify and address problems of applying a controlled vo...
This paper has been peer-reviewed but does not include the final publisher proof-corrections or jour...
This paper deals with automatic classification of text documents, showing advantages of the classifi...
: We study the automatic classification of Web documents into pre-specified categories, with the ob...
Genre characterizes text differently than the usual subject or prepositional content that has been t...
With the explosive growth in the number of electronic documents available on the internet, intranets...
With the evolution of Internet, the meaning and accessibility of text documents and electronic infor...
Automatic document classification techniques have been widely advocated for the study of various fi...
This bachelor's thesis deals with automatic document topic classification and provides a brief intro...
This paper describes an experiment in applying standard supervised machine learning algorithms (C4.5...
The internet is frequently surfed by people by using smartphones, laptops, or computers in order to ...