Classical methods such as Principal Component Analysis (PCA) and Canonical Correlation Analysis (CCA) are ubiquitous in statistics. However, these techniques are only able to reveal linear re-lationships in data. Although nonlinear variants of PCA and CCA have been proposed, these are computationally prohibitive in the large scale. In a separate strand of recent research, randomized methods have been proposed to construct features that help reveal nonlinear patterns in data. For basic tasks such as regression or classification, random features exhibit little or no loss in performance, while achieving drastic savings in computational requirements. In this paper we leverage randomness to design scalable new variants of nonlinear PCA and CCA; ...
In this article, we propose a new framework for matrix factorization based on principal component an...
We address two issues that are fundamental to the analysis of naturally-occurring datasets: how to e...
Summarising a high dimensional data set with a low dimensional embedding is a standard approach for ...
Principal component analysis (PCA) is a widespread technique for data analysis that relies on the co...
This paper develops and analyzes a randomized design for robust Principal Component Analysis (PCA). ...
This book expounds the principle and related applications of nonlinear principal component analysis ...
In this paper, a randomized PCA algorithm that is robust to the presence of outliers and whose compl...
Robust Principal Component Analysis (PCA) (or robust subspace recovery) is a particularly important ...
Principal component analysis (PCA) is a ubiquitous statistical technique for data analysis. PCA is ...
Pearson’s correlation measure is only able to model linear dependence between random variables. Henc...
Probabilistic principal component analysis (PPCA) seeks a low dimensional representation of a data s...
Probabilistic principal component analysis (PPCA) seeks a low dimensional representation of a data s...
The simulation of multivariate data is often necessary for assessing the performance of multivariat...
Abstract The idea of summarizing the information contained in a large number of variables by a small...
High dimensional, low sample, data have singular sample covariance matrices,rendering them impossibl...
In this article, we propose a new framework for matrix factorization based on principal component an...
We address two issues that are fundamental to the analysis of naturally-occurring datasets: how to e...
Summarising a high dimensional data set with a low dimensional embedding is a standard approach for ...
Principal component analysis (PCA) is a widespread technique for data analysis that relies on the co...
This paper develops and analyzes a randomized design for robust Principal Component Analysis (PCA). ...
This book expounds the principle and related applications of nonlinear principal component analysis ...
In this paper, a randomized PCA algorithm that is robust to the presence of outliers and whose compl...
Robust Principal Component Analysis (PCA) (or robust subspace recovery) is a particularly important ...
Principal component analysis (PCA) is a ubiquitous statistical technique for data analysis. PCA is ...
Pearson’s correlation measure is only able to model linear dependence between random variables. Henc...
Probabilistic principal component analysis (PPCA) seeks a low dimensional representation of a data s...
Probabilistic principal component analysis (PPCA) seeks a low dimensional representation of a data s...
The simulation of multivariate data is often necessary for assessing the performance of multivariat...
Abstract The idea of summarizing the information contained in a large number of variables by a small...
High dimensional, low sample, data have singular sample covariance matrices,rendering them impossibl...
In this article, we propose a new framework for matrix factorization based on principal component an...
We address two issues that are fundamental to the analysis of naturally-occurring datasets: how to e...
Summarising a high dimensional data set with a low dimensional embedding is a standard approach for ...