AbstractWe study clustering algorithms based on neighborhood graphs on a random sample of data points. The question we ask is how such a graph should be constructed in order to obtain optimal clustering results. Which type of neighborhood graph should one choose, mutual k-nearest-neighbor or symmetric k-nearest-neighbor? What is the optimal parameter k? In our setting, clusters are defined as connected components of the t-level set of the underlying probability distribution. Clusters are said to be identified in the neighborhood graph if connected components in the graph correspond to the true underlying clusters. Using techniques from random geometric graph theory, we prove bounds on the probability that clusters are identified successfull...
Spectral clustering is a well-known graph-theoretic clustering algorithm. Although spectral clusteri...
Clustering is indispensable for data analysis in many scientific disciplines. Detecting clusters fro...
We consider the following clustering problem: we have a complete graph on vertices (items), where e...
We study clustering algorithms based on neighborhood graphs on a random sample of data points. The q...
We study clustering algorithms based on neighborhood graphs on a random sample of data points. The q...
AbstractWe study clustering algorithms based on neighborhood graphs on a random sample of data point...
Assume we are given a sample of points from some underlying distribution which contains several dist...
We present a procedure for the identification of clusters in multivariate data sets, based on the co...
Nearest neighbor ($k$-NN) graphs are widely used in machine learning and data mining applications, a...
Data clustering is a fundamental machine learning problem. Community structure is common in social a...
Abstract: Clustering is a well known data mining technique which is used to group together data item...
Graph clustering methods such as spectral clustering are defined for general weighted graphs. In mac...
The problem of graph clustering is a central optimization problem with various applications in numer...
The ''nearest-neighbor'' relation, or more generally the ''k-nearest-neighbors'' relation, defined f...
The ''nearest neighbor'' relation, or more generally the ''k nearest neighbors'' relation, defined f...
Spectral clustering is a well-known graph-theoretic clustering algorithm. Although spectral clusteri...
Clustering is indispensable for data analysis in many scientific disciplines. Detecting clusters fro...
We consider the following clustering problem: we have a complete graph on vertices (items), where e...
We study clustering algorithms based on neighborhood graphs on a random sample of data points. The q...
We study clustering algorithms based on neighborhood graphs on a random sample of data points. The q...
AbstractWe study clustering algorithms based on neighborhood graphs on a random sample of data point...
Assume we are given a sample of points from some underlying distribution which contains several dist...
We present a procedure for the identification of clusters in multivariate data sets, based on the co...
Nearest neighbor ($k$-NN) graphs are widely used in machine learning and data mining applications, a...
Data clustering is a fundamental machine learning problem. Community structure is common in social a...
Abstract: Clustering is a well known data mining technique which is used to group together data item...
Graph clustering methods such as spectral clustering are defined for general weighted graphs. In mac...
The problem of graph clustering is a central optimization problem with various applications in numer...
The ''nearest-neighbor'' relation, or more generally the ''k-nearest-neighbors'' relation, defined f...
The ''nearest neighbor'' relation, or more generally the ''k nearest neighbors'' relation, defined f...
Spectral clustering is a well-known graph-theoretic clustering algorithm. Although spectral clusteri...
Clustering is indispensable for data analysis in many scientific disciplines. Detecting clusters fro...
We consider the following clustering problem: we have a complete graph on vertices (items), where e...