International audienceAutomated subject classification has been a challenging research issue for many years now, receiving particular attention in the past decade due to rapid increase of digital documents. The most frequent approach to automated classification is machine learning. It, however, requires training documents and performs well on new documents only if these are similar enough to the former. We explore a string-matching algorithm based on a controlled vocabulary, which does not require training documents – instead it reuses the intellectual work put into creating the controlled vocabulary. Terms from the Engineering Information thesaurus and classification scheme were matched against title and abstract of engineering papers from...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
In the last ten years, automatic Text Categorization (TC) has been gaining an increasing interest fr...
This paper describes the usage of machine learning techniques to assign keywords to documents. The l...
International audienceAutomated subject classification has been a challenging research issue for man...
With the exponential growth of the World Wide Web, automated subject classification has become a maj...
The primary objective of this study was to identify and address problems of applying a controlled vo...
While automated methods for information organization have been around for several decades now, expon...
The paper aims to explore to what degree different types of terms in engineering information (Ei) th...
A machine-learning and a string-matching approach to automated subject classification of text were c...
A significant part of professional communication development in engineering is the ability to learn ...
This paper describes an experiment in applying standard supervised machine learning algorithms (C4.5...
With the explosive growth in the number of electronic documents available on the internet, intranets...
The (unheralded) first step in many applications of automated text analysis involves selecting keywo...
The (unheralded) first step in many applications of automated text analysis involves selecting keywo...
As the dramatic expansion of online publications continues, state libraries urgently need effective ...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
In the last ten years, automatic Text Categorization (TC) has been gaining an increasing interest fr...
This paper describes the usage of machine learning techniques to assign keywords to documents. The l...
International audienceAutomated subject classification has been a challenging research issue for man...
With the exponential growth of the World Wide Web, automated subject classification has become a maj...
The primary objective of this study was to identify and address problems of applying a controlled vo...
While automated methods for information organization have been around for several decades now, expon...
The paper aims to explore to what degree different types of terms in engineering information (Ei) th...
A machine-learning and a string-matching approach to automated subject classification of text were c...
A significant part of professional communication development in engineering is the ability to learn ...
This paper describes an experiment in applying standard supervised machine learning algorithms (C4.5...
With the explosive growth in the number of electronic documents available on the internet, intranets...
The (unheralded) first step in many applications of automated text analysis involves selecting keywo...
The (unheralded) first step in many applications of automated text analysis involves selecting keywo...
As the dramatic expansion of online publications continues, state libraries urgently need effective ...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
In the last ten years, automatic Text Categorization (TC) has been gaining an increasing interest fr...
This paper describes the usage of machine learning techniques to assign keywords to documents. The l...