We revisit recently proposed algorithms for probabilistic clustering with pair-wise constraints between data points. We evaluate and compare existing techniques in terms of robustness to misspecified constraints. We show that the technique that strictly enforces the given constraints, namely the chunklet model, produces poor results even under a small number of misspecified constraints. We further show that methods that penalize constraint violation are more robust to misspecified constraints but have undesirable local behaviors. Based on this evaluation, we propose a new learning technique, extending the chunklet model to allow soft constraints represented by an intuitive measure of confidence in the constraint. 1
Semi-supervised clustering methods incorporate a limited amount of supervision into the clustering p...
Due to strong demand for the ability to enforce top-down struc-ture on clustering results, semi-supe...
Constraint satisfaction and optimization (CSP(O)), probabilistic inference, and data mining are thre...
We revisit recently proposed algorithms for probabilistic clustering with pair-wise constraints betw...
Metric clustering is fundamental in areas ranging from Combinatorial Optimization and Data Mining, t...
A number of clustering algorithms have been proposed for use in tasks where a limited degree of supe...
A number of clustering algorithms have been proposed for use in tasks where a limited degree of supe...
One of the key tools to gain knowledge from data is clustering: identifying groups of instances that...
Constrained clustering addresses the problem of creating minimum variance clusters with the added co...
Unsupervised clustering can be significantly improved using supervision in the form of pairwise cons...
Abstract — While clustering is usually an unsupervised operation, there are circumstances where we h...
Abstract. A number of clustering algorithms have been proposed for use in tasks where a limited degr...
Constraint-based clustering leverages user-provided constraints to produce a clustering that matches...
Significant progress in clustering has been achieved by algorithms that are based on pairwise affini...
The Dirichlet process mixture (DPM) model, a typical Bayesian nonparametric model, can infer the num...
Semi-supervised clustering methods incorporate a limited amount of supervision into the clustering p...
Due to strong demand for the ability to enforce top-down struc-ture on clustering results, semi-supe...
Constraint satisfaction and optimization (CSP(O)), probabilistic inference, and data mining are thre...
We revisit recently proposed algorithms for probabilistic clustering with pair-wise constraints betw...
Metric clustering is fundamental in areas ranging from Combinatorial Optimization and Data Mining, t...
A number of clustering algorithms have been proposed for use in tasks where a limited degree of supe...
A number of clustering algorithms have been proposed for use in tasks where a limited degree of supe...
One of the key tools to gain knowledge from data is clustering: identifying groups of instances that...
Constrained clustering addresses the problem of creating minimum variance clusters with the added co...
Unsupervised clustering can be significantly improved using supervision in the form of pairwise cons...
Abstract — While clustering is usually an unsupervised operation, there are circumstances where we h...
Abstract. A number of clustering algorithms have been proposed for use in tasks where a limited degr...
Constraint-based clustering leverages user-provided constraints to produce a clustering that matches...
Significant progress in clustering has been achieved by algorithms that are based on pairwise affini...
The Dirichlet process mixture (DPM) model, a typical Bayesian nonparametric model, can infer the num...
Semi-supervised clustering methods incorporate a limited amount of supervision into the clustering p...
Due to strong demand for the ability to enforce top-down struc-ture on clustering results, semi-supe...
Constraint satisfaction and optimization (CSP(O)), probabilistic inference, and data mining are thre...