Clustering problems and clustering algorithms are often overly sensitive to the presence of outliers: even a handful of points can greatly affect the structure of the optimal solution and its cost. This is why many algorithms for robust clustering problems have been formulated in recent years. These algorithms discard some points as outliers, excluding them from the clustering. However, outlier selection can be unfair: some categories of input points may be disproportionately affected by the outlier removal algorithm. We study the problem of k-clustering with fair outlier removal and provide the first approximation algorithm for well-known clustering formulations, such as k-means and k-median. We analyze this algorithm and prove that it ...
Plain vanilla K-means clustering has proven to be successful in practice, yet it suffers from outlie...
This study enhances K-means Mahalanobis clustering using Density Power Divergence (DPD) for outlier ...
Recent developments in local search analysis have yielded the first polynomial-time approximation sc...
Clustering with outliers is one of the most fundamental problems in Computer Science. Given a set X...
Dissertation presented as the partial requirement for obtaining a Master's degree in Information Man...
This dissertation largely studies problems of two types. In the first part, we study ranking and clu...
This dissertation largely studies problems of two types. In the first part, we study ranking and clu...
Plain vanilla K-means clustering is prone to produce unbalanced clusters and suffers from outlier se...
We introduce a robust k-means-based clustering method for high-dimensional data where not only outli...
[[abstract]]Identifying outliers an remainder clusters which are used to designate few patterns that...
Statistical data frequently includes outliers; these can distort the results of estimation procedure...
Summarization: Over the last years, many variations of the quadratic k-means clustering procedure ha...
Statistical data frequently includes outliers; these can distort the results of estimation procedure...
Clustering is a fundamental problem in unsupervised learning. In many real-world applications, the t...
This study enhances K-means Mahalanobis clustering using Density Power Divergence (DPD) for outlier ...
Plain vanilla K-means clustering has proven to be successful in practice, yet it suffers from outlie...
This study enhances K-means Mahalanobis clustering using Density Power Divergence (DPD) for outlier ...
Recent developments in local search analysis have yielded the first polynomial-time approximation sc...
Clustering with outliers is one of the most fundamental problems in Computer Science. Given a set X...
Dissertation presented as the partial requirement for obtaining a Master's degree in Information Man...
This dissertation largely studies problems of two types. In the first part, we study ranking and clu...
This dissertation largely studies problems of two types. In the first part, we study ranking and clu...
Plain vanilla K-means clustering is prone to produce unbalanced clusters and suffers from outlier se...
We introduce a robust k-means-based clustering method for high-dimensional data where not only outli...
[[abstract]]Identifying outliers an remainder clusters which are used to designate few patterns that...
Statistical data frequently includes outliers; these can distort the results of estimation procedure...
Summarization: Over the last years, many variations of the quadratic k-means clustering procedure ha...
Statistical data frequently includes outliers; these can distort the results of estimation procedure...
Clustering is a fundamental problem in unsupervised learning. In many real-world applications, the t...
This study enhances K-means Mahalanobis clustering using Density Power Divergence (DPD) for outlier ...
Plain vanilla K-means clustering has proven to be successful in practice, yet it suffers from outlie...
This study enhances K-means Mahalanobis clustering using Density Power Divergence (DPD) for outlier ...
Recent developments in local search analysis have yielded the first polynomial-time approximation sc...