Metric k-center clustering is a fundamental unsupervised learning primitive. Although widely used, this primitive is heavily affected by noise in the data, so a more sensible variant seeks for the best solution that disregards a given number z of points of the dataset, which are called outliers. We provide efficient algorithms for this important variant in the streaming model under the sliding window setting, where, at each time step, the dataset to be clustered is the window W of the most recent data items. For general metric spaces, our algorithms achieve O1 approximation and, remarkably, require a working memory linear in k+z and only logarithmic in |W|. For spaces of bounded doubling dimension, the approximation can be made arbitrarily ...
Center-based clustering is a pivotal primitive for unsupervised learning and data analysis. A popul...
In this thesis we show that, for several clustering problems, we can extract a small set of points, ...
Clustering problems and clustering algorithms are often overly sensitive to the presence of outliers...
Metric k-center clustering is a fundamental unsupervised learning primitive. Although widely used, t...
In this paper we present a novel streaming algorithm for the k-center clustering problem for general...
We explore clustering problems in the streaming sliding window model in both general metric spaces a...
In this paper we present novel streaming algorithms for the k-center and the diameter estimation pro...
Center-based clustering is a fundamental primitive for data analysis and becomes very challenging fo...
In this paper we develop streaming algorithms for the diameter problem and the k-center clustering p...
In this paper we develop streaming algorithms for the diameter problem and the k-center clustering p...
We study the problem of k-center clustering with outliers in arbitrary metrics and Euclidean space. ...
Recent developments in local search analysis have yielded the first polynomial-time approximation sc...
In PODS 2003, Babcock, Datar, Motwani and O\u27Callaghan gave the first streaming solution for the k...
Clustering methods are one of the key steps that lead to the transformation of data to knowledge. Cl...
In the k-center problem for streaming points in d-dimensional metric space, input points are given i...
Center-based clustering is a pivotal primitive for unsupervised learning and data analysis. A popul...
In this thesis we show that, for several clustering problems, we can extract a small set of points, ...
Clustering problems and clustering algorithms are often overly sensitive to the presence of outliers...
Metric k-center clustering is a fundamental unsupervised learning primitive. Although widely used, t...
In this paper we present a novel streaming algorithm for the k-center clustering problem for general...
We explore clustering problems in the streaming sliding window model in both general metric spaces a...
In this paper we present novel streaming algorithms for the k-center and the diameter estimation pro...
Center-based clustering is a fundamental primitive for data analysis and becomes very challenging fo...
In this paper we develop streaming algorithms for the diameter problem and the k-center clustering p...
In this paper we develop streaming algorithms for the diameter problem and the k-center clustering p...
We study the problem of k-center clustering with outliers in arbitrary metrics and Euclidean space. ...
Recent developments in local search analysis have yielded the first polynomial-time approximation sc...
In PODS 2003, Babcock, Datar, Motwani and O\u27Callaghan gave the first streaming solution for the k...
Clustering methods are one of the key steps that lead to the transformation of data to knowledge. Cl...
In the k-center problem for streaming points in d-dimensional metric space, input points are given i...
Center-based clustering is a pivotal primitive for unsupervised learning and data analysis. A popul...
In this thesis we show that, for several clustering problems, we can extract a small set of points, ...
Clustering problems and clustering algorithms are often overly sensitive to the presence of outliers...