Plain vanilla K-means clustering is prone to produce unbalanced clusters and suffers from outlier sensitivity. To mitigate both shortcomings, we formulate a joint outlier-detection and clustering problem, which assigns a prescribed number of datapoints to an auxiliary outlier cluster and performs cardinality-constrained K-means clustering on the residual dataset. We cast this problem as a mixed-integer linear program (MILP) that admits tractable semidefinite and linear programming relaxations. We propose deterministic rounding schemes that transform the relaxed solutions to high quality solutions for the MILP. We prove that these solutions are optimal in the MILP if a cluster separation condition holds. To our best knowledge, we propose the...
Covering and clustering are two of the most important areas in the field of combinatorial optimizati...
We present a unified approach for simultaneous clustering and outlier detection in data. We utilize ...
We present a unified approach for simultaneous clustering and outlier detection in data. We utilize ...
Plain vanilla K-means clustering has proven to be successful in practice, yet it suffers from outlie...
Clustering with outliers is one of the most fundamental problems in Computer Science. Given a set X...
Clustering problems and clustering algorithms are often overly sensitive to the presence of outliers...
This dissertation largely studies problems of two types. In the first part, we study ranking and clu...
This dissertation largely studies problems of two types. In the first part, we study ranking and clu...
Constrained clustering addresses the problem of creating minimum variance clusters with the added co...
[[abstract]]Identifying outliers an remainder clusters which are used to designate few patterns that...
Recent developments in local search analysis have yielded the first polynomial-time approximation sc...
We study the problem of clustering a set of data points based on their similarity matrix, each entry...
[[abstract]]In this paper, a two-phase clustering algorithm for outliers detection is proposed. We f...
We study the problem of clustering a set of data points based on their similarity matrix, each entry...
We present a unified approach for simultaneous clustering and outlier detection in data. We utilize ...
Covering and clustering are two of the most important areas in the field of combinatorial optimizati...
We present a unified approach for simultaneous clustering and outlier detection in data. We utilize ...
We present a unified approach for simultaneous clustering and outlier detection in data. We utilize ...
Plain vanilla K-means clustering has proven to be successful in practice, yet it suffers from outlie...
Clustering with outliers is one of the most fundamental problems in Computer Science. Given a set X...
Clustering problems and clustering algorithms are often overly sensitive to the presence of outliers...
This dissertation largely studies problems of two types. In the first part, we study ranking and clu...
This dissertation largely studies problems of two types. In the first part, we study ranking and clu...
Constrained clustering addresses the problem of creating minimum variance clusters with the added co...
[[abstract]]Identifying outliers an remainder clusters which are used to designate few patterns that...
Recent developments in local search analysis have yielded the first polynomial-time approximation sc...
We study the problem of clustering a set of data points based on their similarity matrix, each entry...
[[abstract]]In this paper, a two-phase clustering algorithm for outliers detection is proposed. We f...
We study the problem of clustering a set of data points based on their similarity matrix, each entry...
We present a unified approach for simultaneous clustering and outlier detection in data. We utilize ...
Covering and clustering are two of the most important areas in the field of combinatorial optimizati...
We present a unified approach for simultaneous clustering and outlier detection in data. We utilize ...
We present a unified approach for simultaneous clustering and outlier detection in data. We utilize ...