Several clustering methods (e.g., Normalized Cut and Ratio Cut) divide the Min Cut cost function by a cluster dependent factor (e.g., the size or the degree of the clusters), in order to yield a more balanced partitioning. We, instead, investigate adding such regularizations to the original cost function. We first consider the case where the regularization term is the sum of the squared size of the clusters, and then generalize it to adaptive regularization of the pairwise similarities. This leads to shifting (adaptively) the pairwise similarities which might make some of them negative. We then study the connection of this method to Correlation Clustering and then propose an efficient local search optimization algorithm with fast theoretica...
A set of clustering algorithms with proper weight on the formulation of distance which extend to mix...
We develop a non-convex optimization approach to correlation clustering using the Frank-Wolfe (FW) f...
Many similarity-based clustering methods work in two separate steps including similarity matrix comp...
Hierarchical clustering based on pairwise similarities is a common tool used in a broad range of sci...
Many clustering methods partition the data groups based on the input data similarity matrix. Thus, t...
Data clustering is a difficult problem due to the complex and heterogeneous natures of multidimensio...
Pairwise clustering methods partition the data space into clusters by the pairwise similarity betwee...
We present a novel spectral clustering method that enables users to incor-porate prior knowledge of ...
We consider the problem of learning from a similarity matrix (such as spectral clustering and low-di...
The density-based clustering algorithm DBSCAN is a fundamental technique for data clustering with ma...
International audienceMany papers pointed out the interest of (co-)clustering both data and features...
International audienceThe goal of clustering is to group similar objects into meaningful partitions....
Abstract: Fast accumulation of large amounts of complex data has cre-ated a need for more sophistica...
We study the problem of clustering a set of data points based on their similarity matrix, each entry...
We present an iterative flat hard clustering algorithm designed to operate on arbitrary similarity m...
A set of clustering algorithms with proper weight on the formulation of distance which extend to mix...
We develop a non-convex optimization approach to correlation clustering using the Frank-Wolfe (FW) f...
Many similarity-based clustering methods work in two separate steps including similarity matrix comp...
Hierarchical clustering based on pairwise similarities is a common tool used in a broad range of sci...
Many clustering methods partition the data groups based on the input data similarity matrix. Thus, t...
Data clustering is a difficult problem due to the complex and heterogeneous natures of multidimensio...
Pairwise clustering methods partition the data space into clusters by the pairwise similarity betwee...
We present a novel spectral clustering method that enables users to incor-porate prior knowledge of ...
We consider the problem of learning from a similarity matrix (such as spectral clustering and low-di...
The density-based clustering algorithm DBSCAN is a fundamental technique for data clustering with ma...
International audienceMany papers pointed out the interest of (co-)clustering both data and features...
International audienceThe goal of clustering is to group similar objects into meaningful partitions....
Abstract: Fast accumulation of large amounts of complex data has cre-ated a need for more sophistica...
We study the problem of clustering a set of data points based on their similarity matrix, each entry...
We present an iterative flat hard clustering algorithm designed to operate on arbitrary similarity m...
A set of clustering algorithms with proper weight on the formulation of distance which extend to mix...
We develop a non-convex optimization approach to correlation clustering using the Frank-Wolfe (FW) f...
Many similarity-based clustering methods work in two separate steps including similarity matrix comp...