Abstract Background In text mining, document clustering describes the efforts to assign unstructured documents to clusters, which in turn usually refer to topics. Clustering is widely used in science for data retrieval and organisation. Results In this paper we present and discuss a novel graph-theoretical approach for document clustering and its application on a real-world data set. We will show that the well-known graph partition to stable sets or cliques can be generalized to pseudostable sets or pseudocliques. This allows to perform a soft clustering as well as a hard clustering. The software is freely available on GitHub. Conclusions The presented integer linear programming as well as the greedy approach for this NP $\mathcal {NP}$-com...
This technical report addresses the problem of automatically structuring linked document collections...
This paper addresses the problem of automatically structuring linked document collections by using c...
Document clustering is a popular method for discovering useful information from text data. This pape...
Background: In text mining, document clustering describes the efforts to assign unstructured documen...
In text mining, document clustering describes the efforts to assign unstructured documents to cluste...
Recently published studies have shown that partitional clustering algorithms that optimize certain c...
Clustering is an essential data mining task with numerous applications. Clustering is the process of...
Abstract. In this paper we introduce and analyze two improvements to GDClust [1], a system for docum...
Automatic document clustering is one of the important operations performed on text documents. Most c...
A language-independent method for automatic clustering of certain classes of documents is described....
Abstract: Clustering is the problem of discovering “meaningful ” groups in given data. The first and...
International audienceFollowing the work of Inderjit S. Dhillon, this paper presents the document cl...
Nowadays, the explosive growth in text data emphasizes the need for developing new and computational...
Document clustering is text processing that groups documents with similar concept. Clustering is def...
This paper introduces a new technique of document clustering based on frequent senses. The proposed ...
This technical report addresses the problem of automatically structuring linked document collections...
This paper addresses the problem of automatically structuring linked document collections by using c...
Document clustering is a popular method for discovering useful information from text data. This pape...
Background: In text mining, document clustering describes the efforts to assign unstructured documen...
In text mining, document clustering describes the efforts to assign unstructured documents to cluste...
Recently published studies have shown that partitional clustering algorithms that optimize certain c...
Clustering is an essential data mining task with numerous applications. Clustering is the process of...
Abstract. In this paper we introduce and analyze two improvements to GDClust [1], a system for docum...
Automatic document clustering is one of the important operations performed on text documents. Most c...
A language-independent method for automatic clustering of certain classes of documents is described....
Abstract: Clustering is the problem of discovering “meaningful ” groups in given data. The first and...
International audienceFollowing the work of Inderjit S. Dhillon, this paper presents the document cl...
Nowadays, the explosive growth in text data emphasizes the need for developing new and computational...
Document clustering is text processing that groups documents with similar concept. Clustering is def...
This paper introduces a new technique of document clustering based on frequent senses. The proposed ...
This technical report addresses the problem of automatically structuring linked document collections...
This paper addresses the problem of automatically structuring linked document collections by using c...
Document clustering is a popular method for discovering useful information from text data. This pape...