In this work, we jointly apply several text mining methods to a corpus of legal documents in order to compare the separation quality of two inherently different document classification schemes. The classification schemes are compared with the clusters produced by the k-means algorithm. In the future, we believe that our comparison method will be coupled with semi-supervised and active learning techniques. Also, this paper presents the idea of combining k-means and Principal Component Analysis for cluster visualization.The described idea allows calculations to be performed in reasonable amount of CPU time.status: publishe
AbstractText clustering is an important application of data mining. It is concerned with grouping si...
Thesis (M.Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Master of ScienceDepartment of Computer ScienceWilliam HsuThis work describes a comparative study of...
The text is nothing but the combination of characters. Therefore, analyzing and extracting informati...
The fundamental goal of this research is to learn whether unsupervised learning can be used to clust...
AbstractText clustering is an important application of data mining. It is concerned with grouping si...
Clustering of text data is one of tasks of text mining. It divides documents into the different cate...
This study presents the results of an experimental study of two document clustering techniques which...
Increased advancement in a variety of study subjects and information technologies, has increased the...
Document clustering is a process of grouping documents into several natural and homogeneous clusters...
Data mining, also known as knowledge discovery in database (KDD), is the process to discover interes...
Data mining (DM) brings theories from several field including databases and optimization and data vi...
The fundamentals of human communication are language and written texts. Social media is an essential...
Few studies on text clustering for the Malay language have been conducted due to some limitations th...
Abstract: Clustering is a technique of collecting data into subsets in such a manner that identical ...
AbstractText clustering is an important application of data mining. It is concerned with grouping si...
Thesis (M.Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Master of ScienceDepartment of Computer ScienceWilliam HsuThis work describes a comparative study of...
The text is nothing but the combination of characters. Therefore, analyzing and extracting informati...
The fundamental goal of this research is to learn whether unsupervised learning can be used to clust...
AbstractText clustering is an important application of data mining. It is concerned with grouping si...
Clustering of text data is one of tasks of text mining. It divides documents into the different cate...
This study presents the results of an experimental study of two document clustering techniques which...
Increased advancement in a variety of study subjects and information technologies, has increased the...
Document clustering is a process of grouping documents into several natural and homogeneous clusters...
Data mining, also known as knowledge discovery in database (KDD), is the process to discover interes...
Data mining (DM) brings theories from several field including databases and optimization and data vi...
The fundamentals of human communication are language and written texts. Social media is an essential...
Few studies on text clustering for the Malay language have been conducted due to some limitations th...
Abstract: Clustering is a technique of collecting data into subsets in such a manner that identical ...
AbstractText clustering is an important application of data mining. It is concerned with grouping si...
Thesis (M.Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Master of ScienceDepartment of Computer ScienceWilliam HsuThis work describes a comparative study of...