Cluster analyses are often conducted with the goal to characterize an underlying probability density, for which the data-point density serves as an estimate for this probability density. We here test and benchmark the common nearest neighbor (CNN) cluster algorithm. This algorithm assigns a spherical neighborhood R to each data point and estimates the data-point density between two data points as the number of data points N in the overlapping region of their neighborhoods (step 1). The main principle in the CNN cluster algorithm is cluster growing. This grows the clusters by sequentially adding data points and thereby effectively positions the border of the clusters along an iso-surface of the underlying probability density. This yields a s...
Cluster validation constitutes one of the most challenging problems in unsupervised cluster analysis...
A novel neural network clustering algorithm, CoRe, is benchmarked against previously published resul...
Data clustering is a fundamental machine learning problem. Community structure is common in social a...
Density-based clustering procedures are widely used in a variety of data science applications. Their...
The core-set approach is a discretization method for Markov state models of complex molecular dynami...
1 Introduction Clustering is the process of allocating points in a given dataset into disjoint and m...
The identification of metastable states of a molecule plays an important role in the interpretation o...
International audienceAbstract Motivation Density Peaks is a widely spread clustering algorithm that...
Many clustering methods, such as K-means, kernel K-means, and MNcut clustering, follow the same reci...
Clustering assigns data points into groups called clusters, which define the characteristics of simi...
A major problem in cluster analysis is determining the number of subpopulations from the sample data...
Clustering is a powerful exploratory technique for extracting the knowledge of given data. Several c...
We present an unsupervised data processing workflow that is specifically designed to obtain a fast c...
We present a procedure for the identification of clusters in multivariate data sets, based on the co...
A novel clustering algorithm CSHARP is presented for the purpose of finding clusters of arbitrary sh...
Cluster validation constitutes one of the most challenging problems in unsupervised cluster analysis...
A novel neural network clustering algorithm, CoRe, is benchmarked against previously published resul...
Data clustering is a fundamental machine learning problem. Community structure is common in social a...
Density-based clustering procedures are widely used in a variety of data science applications. Their...
The core-set approach is a discretization method for Markov state models of complex molecular dynami...
1 Introduction Clustering is the process of allocating points in a given dataset into disjoint and m...
The identification of metastable states of a molecule plays an important role in the interpretation o...
International audienceAbstract Motivation Density Peaks is a widely spread clustering algorithm that...
Many clustering methods, such as K-means, kernel K-means, and MNcut clustering, follow the same reci...
Clustering assigns data points into groups called clusters, which define the characteristics of simi...
A major problem in cluster analysis is determining the number of subpopulations from the sample data...
Clustering is a powerful exploratory technique for extracting the knowledge of given data. Several c...
We present an unsupervised data processing workflow that is specifically designed to obtain a fast c...
We present a procedure for the identification of clusters in multivariate data sets, based on the co...
A novel clustering algorithm CSHARP is presented for the purpose of finding clusters of arbitrary sh...
Cluster validation constitutes one of the most challenging problems in unsupervised cluster analysis...
A novel neural network clustering algorithm, CoRe, is benchmarked against previously published resul...
Data clustering is a fundamental machine learning problem. Community structure is common in social a...