We conduct the first systematic comparison of automated semantic annotation based on either the full-text or only on the title metadata of documents. Apart from the prominent text classification baselines kNN and SVM, we also compare recent techniques of Learning to Rank and neural networks and revisit the traditional methods logistic regression, Rocchio, and Naive Bayes. Across three of our four datasets, the performance of the classifications using only titles reaches over 90% of the quality compared to the performance when using the full-text
We propose an intelligent document title classification agent based on a theory of information infer...
While there are many studies on information retrieval models using full-text, there are presently no...
This paper presents a text annotation method based on semantic sequences to label a document and a c...
We conduct the first systematic comparison of automated semantic annotation based on either the full...
We conduct the first systematic comparison of automated semantic annotation based on either the full...
For (semi-)automated subject indexing systems in digital libraries, it is often more practical to us...
For (semi-)automated subject indexing systems in digital libraries, it is often more practical to us...
For (semi-)automated subject indexing systems in digital libraries, it is often more practical to us...
While there are many studies on information retrieval models using full-text, there are presently no...
While there are many studies on information retrieval models using full-text, there are presently no...
We introduce a framework for automated semantic document annotation that is composed of four process...
We propose a novel attention network for document annotation with user-generated tags. The network i...
To digest tremendous documents efficiently, people often resort to their titles, which normally prov...
In this paper, we introduce a multi-label lazy learning approach to deal with automatic semantic ind...
Search engines sometimes apply the search on the full text of documents or web-pages; but sometimes ...
We propose an intelligent document title classification agent based on a theory of information infer...
While there are many studies on information retrieval models using full-text, there are presently no...
This paper presents a text annotation method based on semantic sequences to label a document and a c...
We conduct the first systematic comparison of automated semantic annotation based on either the full...
We conduct the first systematic comparison of automated semantic annotation based on either the full...
For (semi-)automated subject indexing systems in digital libraries, it is often more practical to us...
For (semi-)automated subject indexing systems in digital libraries, it is often more practical to us...
For (semi-)automated subject indexing systems in digital libraries, it is often more practical to us...
While there are many studies on information retrieval models using full-text, there are presently no...
While there are many studies on information retrieval models using full-text, there are presently no...
We introduce a framework for automated semantic document annotation that is composed of four process...
We propose a novel attention network for document annotation with user-generated tags. The network i...
To digest tremendous documents efficiently, people often resort to their titles, which normally prov...
In this paper, we introduce a multi-label lazy learning approach to deal with automatic semantic ind...
Search engines sometimes apply the search on the full text of documents or web-pages; but sometimes ...
We propose an intelligent document title classification agent based on a theory of information infer...
While there are many studies on information retrieval models using full-text, there are presently no...
This paper presents a text annotation method based on semantic sequences to label a document and a c...