This paper studies the problem of clustering in metric spaces while preserving the privacy of individual data. Specifically, we examine differentially private variants of the k-medians and Euclidean k-means problems. We present polynomial algorithms with constant multiplicative error and lower additive error than the previous state-of-the-art for each problem. Additionally, our algorithms use a clustering algorithm without differential privacy as a black-box. This allows practitioners to control the trade-off between runtime and approximation factor by choosing a suitable clustering algorithm to use
This paper proposes an effficient solution with high accuracy to the problem of privacy-preserving c...
Abstract k-means++ seeding has become a de facto standard for hard clustering algorithms. In this pa...
Mobile sensor networks are a great source of data. By collecting data with mobile sensor nodes from ...
We introduce a new (ϵₚ, δₚ)-differentially private algorithm for the k-means clustering problem. Giv...
Subspace clustering is an unsupervised learning problem that aims at grouping data points into multi...
This paper, based on differential privacy protecting K-means clustering algorithm, realizes privacy ...
International audienceIn this paper, we present the first differentially private clustering method f...
Find k low-dimensional linear subspaces to ap-proximate a set of unlabeled data points. • k-means ob...
Privacy-preserving data analysis is an emerging area that addresses the dilemma of performing data a...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
When designing clustering algorithms, the choice of initial centers is crucial for the quality of th...
K-means is one of the most important clustering algorithms, but it does introduce a risk of privacy ...
The freedom and transparency of information flow on the Internet has heightened concerns of privacy....
Collecting user data is crucial for advancing machine learning, social science, and government polic...
This work addresses the problem of learning from large collections of data with privacy guarantees. ...
This paper proposes an effficient solution with high accuracy to the problem of privacy-preserving c...
Abstract k-means++ seeding has become a de facto standard for hard clustering algorithms. In this pa...
Mobile sensor networks are a great source of data. By collecting data with mobile sensor nodes from ...
We introduce a new (ϵₚ, δₚ)-differentially private algorithm for the k-means clustering problem. Giv...
Subspace clustering is an unsupervised learning problem that aims at grouping data points into multi...
This paper, based on differential privacy protecting K-means clustering algorithm, realizes privacy ...
International audienceIn this paper, we present the first differentially private clustering method f...
Find k low-dimensional linear subspaces to ap-proximate a set of unlabeled data points. • k-means ob...
Privacy-preserving data analysis is an emerging area that addresses the dilemma of performing data a...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
When designing clustering algorithms, the choice of initial centers is crucial for the quality of th...
K-means is one of the most important clustering algorithms, but it does introduce a risk of privacy ...
The freedom and transparency of information flow on the Internet has heightened concerns of privacy....
Collecting user data is crucial for advancing machine learning, social science, and government polic...
This work addresses the problem of learning from large collections of data with privacy guarantees. ...
This paper proposes an effficient solution with high accuracy to the problem of privacy-preserving c...
Abstract k-means++ seeding has become a de facto standard for hard clustering algorithms. In this pa...
Mobile sensor networks are a great source of data. By collecting data with mobile sensor nodes from ...