We describe work on automatically assigning labels to books using user-defined tags as the label set. Using supervised learning and exploring both binary and mul-ticlass classification, we train and test classifiers on sev-eral sets of features, focusing on the size of the sets, part-of-speech classes and named entities. Results indi-cate that a binary classifier, trained and tested on a fea-ture space that consists of a limited selection of parts of speech as well as all frequent named entities, achieves a classification precision of 81%, significantly outper-forming a baseline which assigns the top-10 most pop-ular tags to each book
Naïve Bayes, k-nearest neighbors, Adaboost, support vector machines and neural networks are five amo...
The aim of the thesis was to use deep learning methods for recognizing text- books and classifying t...
The aim of this thesis is to minimize manual work needed to create training data for text classifica...
Web includes digital libraries and billions of text documents. A fast and simple search through this...
The aim of this work is to study the feasibility of an automated classification of books in the soci...
Abstract. The aim of this work is to study the feasibility of an automated classification of books i...
In many important text classification problems, acquiring class labels for training documents is cos...
Objective: Develop an automated classifier for the classification of bibliographic material by means...
In this paper, we introduce the evolving label-set prob-lem encountered in building real-world text ...
With the explosive growth in the number of electronic documents available on the internet, intranets...
In this thesis, we compare the bag of words approach with doc2vec doc- ument embeddings on the task ...
In this paper, we introduce a method for categoriz-ing digital items according to their topic, only ...
Document classification is a key task for many text min-ing applications. However, traditional text ...
This paper studies the use of different sources of information for performing a text classifcation t...
Effective incorporation of human expertise, while exerting a low cognitive load, is a critical aspec...
Naïve Bayes, k-nearest neighbors, Adaboost, support vector machines and neural networks are five amo...
The aim of the thesis was to use deep learning methods for recognizing text- books and classifying t...
The aim of this thesis is to minimize manual work needed to create training data for text classifica...
Web includes digital libraries and billions of text documents. A fast and simple search through this...
The aim of this work is to study the feasibility of an automated classification of books in the soci...
Abstract. The aim of this work is to study the feasibility of an automated classification of books i...
In many important text classification problems, acquiring class labels for training documents is cos...
Objective: Develop an automated classifier for the classification of bibliographic material by means...
In this paper, we introduce the evolving label-set prob-lem encountered in building real-world text ...
With the explosive growth in the number of electronic documents available on the internet, intranets...
In this thesis, we compare the bag of words approach with doc2vec doc- ument embeddings on the task ...
In this paper, we introduce a method for categoriz-ing digital items according to their topic, only ...
Document classification is a key task for many text min-ing applications. However, traditional text ...
This paper studies the use of different sources of information for performing a text classifcation t...
Effective incorporation of human expertise, while exerting a low cognitive load, is a critical aspec...
Naïve Bayes, k-nearest neighbors, Adaboost, support vector machines and neural networks are five amo...
The aim of the thesis was to use deep learning methods for recognizing text- books and classifying t...
The aim of this thesis is to minimize manual work needed to create training data for text classifica...