De-Biased Sparse PCA: Inference for Eigenstructure of Large Covariance Matrices
Metadata only
Date
2021-04Type
- Journal Article
Abstract
Sparse principal component analysis has become one of the most widely used techniques for dimensionality reduction in high-dimensional datasets. While many methods are available for point estimation of eigenstructure in high-dimensional settings, in this paper we propose methodology for uncertainty quantification, such as construction of confidence intervals and tests for the principal eigenvector and the corresponding largest eigenvalue. We base our methodology on an M-estimator with Lasso penalty which achieves minimax optimal rates and is used to construct a de-biased sparse PCA estimator. The novel estimator has a Gaussian limiting distribution and can be used for hypothesis testing or support recovery of the first eigenvector. The empirical performance of the new estimator is demonstrated on synthetic data and we also show that the estimator compares favourably with the classical PCA in moderately high-dimensional regimes. Show more
Publication status
publishedExternal links
Journal / series
IEEE Transactions on Information TheoryVolume
Pages / Article No.
Publisher
IEEESubject
Covariance matrix; eigenvectors; eigenvalues; PCA; High-dimensional model; sparsity; Lasso; asymptotic normality; confidence intervalsOrganisational unit
03717 - van de Geer, Sara / van de Geer, Sara
More
Show all metadata