We develop a non-convex optimization approach to correlation clustering using the Frank-Wolfe (FW) framework. We show that the basic approach leads to a simple and natural local search algorithm with guaranteed convergence. This algorithm already beats alternative algorithms by substantial margins in both running time and quality of the clustering. Using ideas from FW algorithms, we develop subsampling and variance reduction paradigms for this approach. This yields both a practical improvement of the algorithm and some interesting further directions to investigate. We demonstrate the performance on both synthetic and real world data sets
Abstract Clustering is a fundamental tool for analyzing large data sets. A rich body of work has be...
Abstract. We describe a new optimization scheme for finding high-quality clusterings in planar graph...
An algorithm is developed for solving clustering problems with the similarity measure defined using ...
We develop a non-convex optimization approach to correlation clustering using the Frank-Wolfe (FW) f...
Abstract. CorrelationClustering is now an established problem in the algorithms and constrained clus...
Clustering is one of an interesting data mining topics that can be applied in many fields. Recently...
AbstractWe consider the following general correlation-clustering problem [N. Bansal, A. Blum, S. Cha...
Abstract. Correlation clustering is the problem of finding a crisp par-tition of the vertices of a c...
Several clustering methods (e.g., Normalized Cut and Ratio Cut) divide the Min Cut cost function by ...
Data clustering is a difficult problem due to the complex and heterogeneous natures of multidimensio...
Correlation clustering is a widely studied framework for clustering based on pairwise similarity and...
The problem of cluster analysis is formulated as a problem of non-smooth, non-convex optimization, a...
A straightforward natural iterative heuristic for correlation clustering in the general setting is t...
Clustering is an important unsupervised learning task, with many applications in machine learning, c...
This paper introduces a polynomial time approxima-tion scheme for the metric Correlation Cluster-ing...
Abstract Clustering is a fundamental tool for analyzing large data sets. A rich body of work has be...
Abstract. We describe a new optimization scheme for finding high-quality clusterings in planar graph...
An algorithm is developed for solving clustering problems with the similarity measure defined using ...
We develop a non-convex optimization approach to correlation clustering using the Frank-Wolfe (FW) f...
Abstract. CorrelationClustering is now an established problem in the algorithms and constrained clus...
Clustering is one of an interesting data mining topics that can be applied in many fields. Recently...
AbstractWe consider the following general correlation-clustering problem [N. Bansal, A. Blum, S. Cha...
Abstract. Correlation clustering is the problem of finding a crisp par-tition of the vertices of a c...
Several clustering methods (e.g., Normalized Cut and Ratio Cut) divide the Min Cut cost function by ...
Data clustering is a difficult problem due to the complex and heterogeneous natures of multidimensio...
Correlation clustering is a widely studied framework for clustering based on pairwise similarity and...
The problem of cluster analysis is formulated as a problem of non-smooth, non-convex optimization, a...
A straightforward natural iterative heuristic for correlation clustering in the general setting is t...
Clustering is an important unsupervised learning task, with many applications in machine learning, c...
This paper introduces a polynomial time approxima-tion scheme for the metric Correlation Cluster-ing...
Abstract Clustering is a fundamental tool for analyzing large data sets. A rich body of work has be...
Abstract. We describe a new optimization scheme for finding high-quality clusterings in planar graph...
An algorithm is developed for solving clustering problems with the similarity measure defined using ...