This doctoral thesis focuses on three popular unsupervised learning problems: subspace clustering, robust PCA, and column sampling. For the subspace clustering problem, a new transformative idea is presented. The proposed approach, termed Innovation Pursuit, is a new geometrical solution to the subspace clustering problem whereby subspaces are identified based on their relative novelties. A detailed mathematical analysis is provided establishing sufficient conditions for the proposed method to correctly cluster the data points. The numerical simulations with both real and synthetic data demonstrate that Innovation Pursuit notably outperforms the state-of-the-art subspace clustering algorithms. For the robust PCA problem, we focus on both th...
Increasingly large multimedia databases in life sciences, e-commerce, or monitoring applications can...
This paper presents a remarkably simple, yet powerful, algorithm termed coherence pursuit (CoP) to r...
In subspace clustering, a group of data points belonging to a union of subspaces are assigned member...
This paper explores and analyzes two randomized designs for robust principal component analysis empl...
Recent years have witnessed an explosion of data across scientific fields enabled by advances in sen...
In this paper, a randomized PCA algorithm that is robust to the presence of outliers and whose compl...
The questions brought by high dimensional data is interesting and challenging. Our study is targetin...
The unprecedented growth of data in volume and dimension has led to an increased number of computati...
High dimensional data and the presence of outliers in data each pose a serious challenge in supervis...
A remarkably simple, yet powerful, algorithm termed Coherence Pursuit for robust Principal Component...
This paper presents a new scalable approach, termed Innovation Pursuit (iPursuit), to the problem of...
Subspace clustering is a powerful generalization of clustering for high-dimensional data analysis, w...
Subspace clustering has important and wide applica-tions in computer vision and pattern recognition....
In this paper, we introduce a bottom-up approach to discover clusters of outliers in any m-dimension...
This paper considers the problem of clustering a collection of unlabeled data points assumed to lie ...
Increasingly large multimedia databases in life sciences, e-commerce, or monitoring applications can...
This paper presents a remarkably simple, yet powerful, algorithm termed coherence pursuit (CoP) to r...
In subspace clustering, a group of data points belonging to a union of subspaces are assigned member...
This paper explores and analyzes two randomized designs for robust principal component analysis empl...
Recent years have witnessed an explosion of data across scientific fields enabled by advances in sen...
In this paper, a randomized PCA algorithm that is robust to the presence of outliers and whose compl...
The questions brought by high dimensional data is interesting and challenging. Our study is targetin...
The unprecedented growth of data in volume and dimension has led to an increased number of computati...
High dimensional data and the presence of outliers in data each pose a serious challenge in supervis...
A remarkably simple, yet powerful, algorithm termed Coherence Pursuit for robust Principal Component...
This paper presents a new scalable approach, termed Innovation Pursuit (iPursuit), to the problem of...
Subspace clustering is a powerful generalization of clustering for high-dimensional data analysis, w...
Subspace clustering has important and wide applica-tions in computer vision and pattern recognition....
In this paper, we introduce a bottom-up approach to discover clusters of outliers in any m-dimension...
This paper considers the problem of clustering a collection of unlabeled data points assumed to lie ...
Increasingly large multimedia databases in life sciences, e-commerce, or monitoring applications can...
This paper presents a remarkably simple, yet powerful, algorithm termed coherence pursuit (CoP) to r...
In subspace clustering, a group of data points belonging to a union of subspaces are assigned member...