In this paper, we address the problem of cor-relation clustering in the dynamic data stream model. The stream consists of updates to the edge weights of a graph on n nodes and the goal is to find a node-partition such that the end-points of negative-weight edges are typically in different clusters whereas the end-points of positive-weight edges are typically in the same cluster. We present polynomial-time, O(n ·polylog n)-space approxi-mation algorithms for natural problems that arise. We first develop data structures based on linear sketches that allow the “quality ” of a given node-partition to be measured. We then combine these data structures with convex programming and sampling techniques to solve the relevant approx-imation problem. H...
Abstract. Clustering is to identify densely populated subgroups in data, while correlation analysis ...
Many contemporary data sources in a variety of domains can naturally be represented as fully-dynamic...
In this paper we propose a graph stream clustering algorithm with a unied similarity measure on both...
Abstract Clustering is a fundamental tool for analyzing large data sets. A rich body of work has be...
AbstractWe consider the following general correlation-clustering problem [N. Bansal, A. Blum, S. Cha...
Abstract Common clustering algorithms require multiple scans of all the data to achieve convergence,...
We consider the following clustering problem: we have a complete graph on vertices (items), where e...
We consider the following clustering problem: we have a complete graph on n vertices (items), where ...
Correlation clustering, or multicut partitioning, is widely used in image segmentation for partition...
Correlation clustering, or multicut partitioning, is widely used in image segmentation for partition...
We study a natural generalization of the correlation cluster-ing problem to graphs in which the pair...
We study clustering under the data stream model of computation where: given a sequence of points, th...
"In this paper, we introduce a new clustering strategy for temporally ordered. data streams, which i...
In the Correlation Clustering problem, we are given a graph with its edges labeled as ``similar" and...
We present a novel algorithm for clustering streams of multidimensional points based on kernel densi...
Abstract. Clustering is to identify densely populated subgroups in data, while correlation analysis ...
Many contemporary data sources in a variety of domains can naturally be represented as fully-dynamic...
In this paper we propose a graph stream clustering algorithm with a unied similarity measure on both...
Abstract Clustering is a fundamental tool for analyzing large data sets. A rich body of work has be...
AbstractWe consider the following general correlation-clustering problem [N. Bansal, A. Blum, S. Cha...
Abstract Common clustering algorithms require multiple scans of all the data to achieve convergence,...
We consider the following clustering problem: we have a complete graph on vertices (items), where e...
We consider the following clustering problem: we have a complete graph on n vertices (items), where ...
Correlation clustering, or multicut partitioning, is widely used in image segmentation for partition...
Correlation clustering, or multicut partitioning, is widely used in image segmentation for partition...
We study a natural generalization of the correlation cluster-ing problem to graphs in which the pair...
We study clustering under the data stream model of computation where: given a sequence of points, th...
"In this paper, we introduce a new clustering strategy for temporally ordered. data streams, which i...
In the Correlation Clustering problem, we are given a graph with its edges labeled as ``similar" and...
We present a novel algorithm for clustering streams of multidimensional points based on kernel densi...
Abstract. Clustering is to identify densely populated subgroups in data, while correlation analysis ...
Many contemporary data sources in a variety of domains can naturally be represented as fully-dynamic...
In this paper we propose a graph stream clustering algorithm with a unied similarity measure on both...