Classical model-based partitional clustering algorithms, such ask-means or mixture of Gaussians, provide only loose and indirect control over the size of the resulting clusters. In this work, we present a family of probabilistic clustering models that can be steered towards clusters of desired size by providing a prior distribution over the possible sizes, allowing the analyst to fine-tune exploratory analysis or to produce clusters of suitable size for future down-stream processing.Our formulation supports arbitrary multimodal prior distributions, generalizing the previous work on clustering algorithms searching for clusters of equal size or algorithms designed for the microclustering task of finding small clusters. We provide practical me...
Unsupervised clustering can be significantly improved using supervision in the form of pairwise cons...
The paper introduces a framework for clustering data objects in a similarity-based context. The aim ...
Clustering involves placing entities into mutually exclusive categories. We wish to relax the requir...
Classical model-based partitional clustering algorithms, such as k-means or mixture of Gaussians, pr...
Abstract Microclustering refers to clustering models that produce small clusters or, equivalently, t...
The probabilistic distance clustering method of [1] works well if the cluster sizes are approximatel...
We present a new iterative method for probabilistic clustering of data. Given clusters, their center...
In 2009, Yu et al. proposed a multimodal probability model (MPM) for clustering. This paper makes ad...
One of the common problems with clustering is that the generated clusters often do not match user ex...
Data clustering is a frequently used technique in finance, computer science, and engineering. In mos...
Determining the number of clusters in a dataset is a fundamental issue in data clustering. Many meth...
34 pages, 11 figuresInternational audienceCount data is becoming more and more ubiquitous in a wide ...
We revisit recently proposed algorithms for probabilistic clustering with pair-wise constraints betw...
Cluster analysis is a tool for data analysis. It is a method for finding clusters of a data set with...
We develop a Bayesian framework for tackling the supervised clustering problem, the generic problem ...
Unsupervised clustering can be significantly improved using supervision in the form of pairwise cons...
The paper introduces a framework for clustering data objects in a similarity-based context. The aim ...
Clustering involves placing entities into mutually exclusive categories. We wish to relax the requir...
Classical model-based partitional clustering algorithms, such as k-means or mixture of Gaussians, pr...
Abstract Microclustering refers to clustering models that produce small clusters or, equivalently, t...
The probabilistic distance clustering method of [1] works well if the cluster sizes are approximatel...
We present a new iterative method for probabilistic clustering of data. Given clusters, their center...
In 2009, Yu et al. proposed a multimodal probability model (MPM) for clustering. This paper makes ad...
One of the common problems with clustering is that the generated clusters often do not match user ex...
Data clustering is a frequently used technique in finance, computer science, and engineering. In mos...
Determining the number of clusters in a dataset is a fundamental issue in data clustering. Many meth...
34 pages, 11 figuresInternational audienceCount data is becoming more and more ubiquitous in a wide ...
We revisit recently proposed algorithms for probabilistic clustering with pair-wise constraints betw...
Cluster analysis is a tool for data analysis. It is a method for finding clusters of a data set with...
We develop a Bayesian framework for tackling the supervised clustering problem, the generic problem ...
Unsupervised clustering can be significantly improved using supervision in the form of pairwise cons...
The paper introduces a framework for clustering data objects in a similarity-based context. The aim ...
Clustering involves placing entities into mutually exclusive categories. We wish to relax the requir...