We present a novel unsupervised learning scheme that simultaneously clusters variables of several types (e.g., documents, words and authors) based on pairwise interactions be-tween the types, as observed in co-occurrence data. In this scheme, multiple clustering systems are generated aiming at maximizing an objective function that measures multiple pairwise mutual information between cluster variables. To implement this idea, we pro-pose an algorithm that interleaves top-down clustering of some variables and bottom-up clustering of the other variables, with a local optimization correction routine. Focusing on document clustering we present an extensive empirical study of two-way, three-way and four-way applications of our scheme using six r...
Most existing semi-supervised document clustering approaches are model-based clustering and can be t...
Multi-view clustering has received much attention recently. Most of the existing multi-view clusteri...
Clustering is an unsupervised machine learning technique, which involves discovering different clust...
We present a novel unsupervised learning scheme that simultaneously clusters variables of several ty...
We present a novel unsupervised learning scheme that simultaneously clusters variables of several ty...
Clustering has been widely used for knowledge discovery. In this paper, we propose an effective appr...
Most document clustering algorithms operate in a high dimensional bag-of-words space. The inherent p...
ia that provide significant distinctions between clustering methods and can help selecting appropria...
International audienceMany of the datasets encountered in statistics are two-dimensional in nature a...
In 2009, Yu et al. proposed a multimodal probability model (MPM) for clustering. This paper makes ad...
Clustering ensembles have emerged as a powerful method for improving both the robustness as well as ...
Abstract: Clustering is a technique of collecting data into subsets in such a manner that identical ...
The ever-increasing availability of textual documents has lead to a growing challenge for informatio...
In this paper, we present a document clustering framework incorporating instance-level knowledge in ...
Methods for high-dimensional data clustering represents a prolific research area in data mining, enc...
Most existing semi-supervised document clustering approaches are model-based clustering and can be t...
Multi-view clustering has received much attention recently. Most of the existing multi-view clusteri...
Clustering is an unsupervised machine learning technique, which involves discovering different clust...
We present a novel unsupervised learning scheme that simultaneously clusters variables of several ty...
We present a novel unsupervised learning scheme that simultaneously clusters variables of several ty...
Clustering has been widely used for knowledge discovery. In this paper, we propose an effective appr...
Most document clustering algorithms operate in a high dimensional bag-of-words space. The inherent p...
ia that provide significant distinctions between clustering methods and can help selecting appropria...
International audienceMany of the datasets encountered in statistics are two-dimensional in nature a...
In 2009, Yu et al. proposed a multimodal probability model (MPM) for clustering. This paper makes ad...
Clustering ensembles have emerged as a powerful method for improving both the robustness as well as ...
Abstract: Clustering is a technique of collecting data into subsets in such a manner that identical ...
The ever-increasing availability of textual documents has lead to a growing challenge for informatio...
In this paper, we present a document clustering framework incorporating instance-level knowledge in ...
Methods for high-dimensional data clustering represents a prolific research area in data mining, enc...
Most existing semi-supervised document clustering approaches are model-based clustering and can be t...
Multi-view clustering has received much attention recently. Most of the existing multi-view clusteri...
Clustering is an unsupervised machine learning technique, which involves discovering different clust...