We present a novel unsupervised learning scheme that simultaneously clusters variables of several types (e.g., documents, words and authors) based on pairwise interactions be- tween the types, as observed in co-occurrence data. In this scheme, multiple clustering systems are generated aiming at maximizing an objective function that measures multiple pairwise mutual information between cluster variables. To implement this idea, we pro- pose an algorithm that interleaves top-down clustering of some variables and bottom-up clustering of the other variables, with a local optimization correction routine. Focusing on document clustering we present an extensive empirical study of two-way, three-way and four-way applications of our scheme using six...
International audienceCo-clustering is more useful than one-sided clustering when dealing with high ...
International audienceThe simultaneous clustering of documents and words, known as co-clustering, ha...
Clustering is an unsupervised machine learning technique, which involves discovering different clust...
We present a novel unsupervised learning scheme that simultaneously clusters variables of several ty...
We present a novel unsupervised learning scheme that simultaneously clusters variables of several ty...
We present a novel unsupervised learning scheme that simultaneously clusters variables of several ty...
We present a novel implementation of the recently introduced information bottleneck method for unsup...
Most document clustering algorithms operate in a high dimensional bag-of-words space. The inherent p...
Traditional clustering is typically based on a single feature set. In some domains, several feature ...
International audienceMany of the datasets encountered in statistics are two-dimensional in nature a...
International audienceWe applied different clustering algorithms to the task of clus- tering multi-w...
Manual document categorization is time consuming, expensive, and difficult to manage for large colle...
Multi-view clustering has received much attention recently. Most of the existing multi-view clusteri...
Methods for high-dimensional data clustering represents a prolific research area in data mining, enc...
In this paper, we present a document clustering framework incorporating instance-level knowledge in ...
International audienceCo-clustering is more useful than one-sided clustering when dealing with high ...
International audienceThe simultaneous clustering of documents and words, known as co-clustering, ha...
Clustering is an unsupervised machine learning technique, which involves discovering different clust...
We present a novel unsupervised learning scheme that simultaneously clusters variables of several ty...
We present a novel unsupervised learning scheme that simultaneously clusters variables of several ty...
We present a novel unsupervised learning scheme that simultaneously clusters variables of several ty...
We present a novel implementation of the recently introduced information bottleneck method for unsup...
Most document clustering algorithms operate in a high dimensional bag-of-words space. The inherent p...
Traditional clustering is typically based on a single feature set. In some domains, several feature ...
International audienceMany of the datasets encountered in statistics are two-dimensional in nature a...
International audienceWe applied different clustering algorithms to the task of clus- tering multi-w...
Manual document categorization is time consuming, expensive, and difficult to manage for large colle...
Multi-view clustering has received much attention recently. Most of the existing multi-view clusteri...
Methods for high-dimensional data clustering represents a prolific research area in data mining, enc...
In this paper, we present a document clustering framework incorporating instance-level knowledge in ...
International audienceCo-clustering is more useful than one-sided clustering when dealing with high ...
International audienceThe simultaneous clustering of documents and words, known as co-clustering, ha...
Clustering is an unsupervised machine learning technique, which involves discovering different clust...