In the k-center problem for streaming points in d-dimensional metric space, input points are given in a data stream and the goal is to find the k smallest congruent balls whose union covers all input points by examining them. In the single-pass streaming model, input points are allowed to be examined only once and the amount of space that can be used to store relative information is limited. In this paper, we present a single-pass, (1.8 + epsilon)-factor, 0 (d/epsilon)-space data stream algorithm for the Euclidean 2-center problem for any d >= 1. This is the first result with an approximation factor below 2 using O (d/epsilon) space for any d. Our algorithm naturally extends to the Euclidean k-center problem with k > 2. We present a single-...
We explore clustering problems in the streaming sliding window model in both general metric spaces a...
Center-based clustering is a fundamental primitive for data analysis and becomes very challenging fo...
We study the 1-center problem with outliers in high-dimensional data streams. The problem definition...
In this paper we present a novel streaming algorithm for the k-center clustering problem for general...
We study the 2-center problem with outliers in high-dimensional data streams. Given a stream of poin...
Many applications such as financial transactions data, customer click stream continuously generates\...
We study clustering under the data stream model of computation where: given a sequence of points, th...
This thesis studies clustering problems on data streams, specifically with applications to metric sp...
In the matroid center problem, which generalizes the k-center problem, we need to pick a set of cent...
In this paper we present an n O(k 1\Gamma1=d ) time algorithm for solving the k-center problem i...
The k-means problem seeks a clustering that minimizes the sum of squared errors cost function: For i...
We present a novel algorithm for clustering streams of multidimensional points based on kernel densi...
In this paper we investigate algorithms and lower bounds for summarization problems over a single ...
Metric k-center clustering is a fundamental unsupervised learning primitive. Although widely used, t...
We present multiple pass streaming algorithms for a basic clustering problem for massive data sets. ...
We explore clustering problems in the streaming sliding window model in both general metric spaces a...
Center-based clustering is a fundamental primitive for data analysis and becomes very challenging fo...
We study the 1-center problem with outliers in high-dimensional data streams. The problem definition...
In this paper we present a novel streaming algorithm for the k-center clustering problem for general...
We study the 2-center problem with outliers in high-dimensional data streams. Given a stream of poin...
Many applications such as financial transactions data, customer click stream continuously generates\...
We study clustering under the data stream model of computation where: given a sequence of points, th...
This thesis studies clustering problems on data streams, specifically with applications to metric sp...
In the matroid center problem, which generalizes the k-center problem, we need to pick a set of cent...
In this paper we present an n O(k 1\Gamma1=d ) time algorithm for solving the k-center problem i...
The k-means problem seeks a clustering that minimizes the sum of squared errors cost function: For i...
We present a novel algorithm for clustering streams of multidimensional points based on kernel densi...
In this paper we investigate algorithms and lower bounds for summarization problems over a single ...
Metric k-center clustering is a fundamental unsupervised learning primitive. Although widely used, t...
We present multiple pass streaming algorithms for a basic clustering problem for massive data sets. ...
We explore clustering problems in the streaming sliding window model in both general metric spaces a...
Center-based clustering is a fundamental primitive for data analysis and becomes very challenging fo...
We study the 1-center problem with outliers in high-dimensional data streams. The problem definition...