Text clustering and classi cation are important machine learning tasks. In this work, a combination of their approaches is presented. The main purpose was to automatically prepare a set of clusters (or generally concepts), which would subsequently serve as a training data for learning of a classiffi er. This work comprises of theoretical background, implementation details and experimental results of clustering and classi cation of text documents. A train set of documents is rst hierarchically clustered by the bisecting k-means algorithm. The result is o ered to an expert for modifi cations and possible improvements of the hierarchy. Following this, the resulting structure is used for learning of a naive Bayes classi er and a test set of doc...
Feature clustering is a powerful method to reduce the dimensionality of feature vectors for text cla...
Tato práce se zabývá shlukovacími metodami v oblasti textu, které jsou testovány za účelem použití j...
Text classification is the task of automatically sorting a set of documents into categories from a p...
Text clustering and classi cation are important machine learning tasks. In this work, a combination ...
Abstract — The objective of clustering is to partition an unstructured set of objects into clusters ...
Clustering of text data is one of tasks of text mining. It divides documents into the different cate...
This paper addresses the problem of learning to classify texts by exploiting information derived fro...
The thesis deals with text mining. It describes the theory of text document clustering as well as al...
Nowadays, the explosive growth in text data emphasizes the need for developing new and computational...
The text is nothing but the combination of characters. Therefore, analyzing and extracting informati...
Document clustering is text processing that groups documents with similar concept. Clustering is def...
Supervised and unsupervised learning have been the focus of critical research in the areas of machin...
Supervised and unsupervised learning have been the focus of critical research in the areas of machin...
Supervised and unsupervised learning have been the focus of critical research in the areas of machin...
Tato práce se zabývá shlukovacími metodami v oblasti textu, které jsou testovány za účelem použití j...
Feature clustering is a powerful method to reduce the dimensionality of feature vectors for text cla...
Tato práce se zabývá shlukovacími metodami v oblasti textu, které jsou testovány za účelem použití j...
Text classification is the task of automatically sorting a set of documents into categories from a p...
Text clustering and classi cation are important machine learning tasks. In this work, a combination ...
Abstract — The objective of clustering is to partition an unstructured set of objects into clusters ...
Clustering of text data is one of tasks of text mining. It divides documents into the different cate...
This paper addresses the problem of learning to classify texts by exploiting information derived fro...
The thesis deals with text mining. It describes the theory of text document clustering as well as al...
Nowadays, the explosive growth in text data emphasizes the need for developing new and computational...
The text is nothing but the combination of characters. Therefore, analyzing and extracting informati...
Document clustering is text processing that groups documents with similar concept. Clustering is def...
Supervised and unsupervised learning have been the focus of critical research in the areas of machin...
Supervised and unsupervised learning have been the focus of critical research in the areas of machin...
Supervised and unsupervised learning have been the focus of critical research in the areas of machin...
Tato práce se zabývá shlukovacími metodami v oblasti textu, které jsou testovány za účelem použití j...
Feature clustering is a powerful method to reduce the dimensionality of feature vectors for text cla...
Tato práce se zabývá shlukovacími metodami v oblasti textu, které jsou testovány za účelem použití j...
Text classification is the task of automatically sorting a set of documents into categories from a p...