Center-based clustering is a fundamental primitive for data analysis and is very challenging for large datasets. We developed coreset based space/round-efficient MapReduce algorithms to solve the k-center, k-median, and k-means variants in general metrics. Remarkably, the algorithms obliviously adapt to the doubling dimension of the metric space, and attain approximation ratios that can be made arbitrarily close to those achievable by the best known polynomial-time sequential approximations
Probably the most famous clustering formulation is k-means. This is the focus today. Note: k-means i...
In this talk, we give an overview of the current best approximation algorithms for fundamental clust...
<p>In the first part of this chapter we detail center based clustering methods, namely methods based...
Center-based clustering is a fundamental primitive for data analysis and becomes very challenging fo...
Center-based clustering is a fundamental primitive for data analysis and becomes very challenging fo...
This paper provides new algorithms for distributed clustering for two popular center-based objec-tiv...
This paper provides new algorithms for distributed clustering for two popular center-based objec-tiv...
In this paper, we show that for several clustering problems one can extract a small set of points, s...
In this thesis we show that, for several clustering problems, we can extract a small set of points, ...
Center-based clustering is a pivotal primitive for unsupervised learning and data analysis. A popul...
Clustering problems have numerous applications and are becoming more challenging as the size of the ...
Center-based clustering is a fundamental primitive for data analysis and becomes very challenging fo...
In this paper we present an n O(k 1\Gamma1=d ) time algorithm for solving the k-center problem i...
In the first part of this chapter we detail center based clustering methods, namely methods based on...
In discrete k-center and k-median clustering, we are given a set of points P in a metric space M, an...
Probably the most famous clustering formulation is k-means. This is the focus today. Note: k-means i...
In this talk, we give an overview of the current best approximation algorithms for fundamental clust...
<p>In the first part of this chapter we detail center based clustering methods, namely methods based...
Center-based clustering is a fundamental primitive for data analysis and becomes very challenging fo...
Center-based clustering is a fundamental primitive for data analysis and becomes very challenging fo...
This paper provides new algorithms for distributed clustering for two popular center-based objec-tiv...
This paper provides new algorithms for distributed clustering for two popular center-based objec-tiv...
In this paper, we show that for several clustering problems one can extract a small set of points, s...
In this thesis we show that, for several clustering problems, we can extract a small set of points, ...
Center-based clustering is a pivotal primitive for unsupervised learning and data analysis. A popul...
Clustering problems have numerous applications and are becoming more challenging as the size of the ...
Center-based clustering is a fundamental primitive for data analysis and becomes very challenging fo...
In this paper we present an n O(k 1\Gamma1=d ) time algorithm for solving the k-center problem i...
In the first part of this chapter we detail center based clustering methods, namely methods based on...
In discrete k-center and k-median clustering, we are given a set of points P in a metric space M, an...
Probably the most famous clustering formulation is k-means. This is the focus today. Note: k-means i...
In this talk, we give an overview of the current best approximation algorithms for fundamental clust...
<p>In the first part of this chapter we detail center based clustering methods, namely methods based...