In this paper, we consider the problem of clustering data points into low-dimensional subspaces in the presence of outliers. We pose the problem using a density estimation formulation with an associated generative model. Based on this probability model, we first develop an iterative expectation-maximization (EM) al-gorithm and then derive its global solution. In addition, we develop two Bayesian methods based on variational Bayesian (VB) approximation, which are capable of automatic dimensionality selection. While the first method is based on an al-ternating optimization scheme for all unknowns, the second method makes use of recent results in VB matrix factorization leading to fast and effective estimation. Both methods are extended to han...
In this paper, a randomized PCA algorithm that is robust to the presence of outliers and whose compl...
Subspace clustering separates data points ap-proximately lying on union of affine subspaces into sev...
We study the problem of clustering a set of data points based on their similarity matrix, each entry...
When a probabilistic model and its prior are given, Bayesian learning offers infer-ence with automat...
The problem of clustering noisy and incompletely observed high-dimensional data points into a union ...
This paper considers the problem of clustering a collection of unlabeled data points assumed to lie ...
In this paper, we address the subspace clustering problem. Given a set of data samples (vectors) app...
Abstract—The problem of detecting clusters in high-dimensional data is increasingly common in machin...
In this paper, we address the subspace clustering problem. Given a set of data samples (vectors) app...
Subspace clustering has important and wide applica-tions in computer vision and pattern recognition....
Nowadays we are in the big data era,where the data is usually high dimensional.How to process high d...
Recent advances of subspace clustering have provided a new way of constructing affinity matrices for...
The application of machine learning to inference problems in biology is dominated by supervised lear...
The questions brought by high dimensional data is interesting and challenging. Our study is targetin...
Abstract Subspace outlier detection has emerged as a practical approach for outlier detection. Class...
In this paper, a randomized PCA algorithm that is robust to the presence of outliers and whose compl...
Subspace clustering separates data points ap-proximately lying on union of affine subspaces into sev...
We study the problem of clustering a set of data points based on their similarity matrix, each entry...
When a probabilistic model and its prior are given, Bayesian learning offers infer-ence with automat...
The problem of clustering noisy and incompletely observed high-dimensional data points into a union ...
This paper considers the problem of clustering a collection of unlabeled data points assumed to lie ...
In this paper, we address the subspace clustering problem. Given a set of data samples (vectors) app...
Abstract—The problem of detecting clusters in high-dimensional data is increasingly common in machin...
In this paper, we address the subspace clustering problem. Given a set of data samples (vectors) app...
Subspace clustering has important and wide applica-tions in computer vision and pattern recognition....
Nowadays we are in the big data era,where the data is usually high dimensional.How to process high d...
Recent advances of subspace clustering have provided a new way of constructing affinity matrices for...
The application of machine learning to inference problems in biology is dominated by supervised lear...
The questions brought by high dimensional data is interesting and challenging. Our study is targetin...
Abstract Subspace outlier detection has emerged as a practical approach for outlier detection. Class...
In this paper, a randomized PCA algorithm that is robust to the presence of outliers and whose compl...
Subspace clustering separates data points ap-proximately lying on union of affine subspaces into sev...
We study the problem of clustering a set of data points based on their similarity matrix, each entry...