File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: STATISTICAL INFERENCE FOR PRINCIPAL COMPONENTS OF SPIKED COVARIANCE MATRICES

TitleSTATISTICAL INFERENCE FOR PRINCIPAL COMPONENTS OF SPIKED COVARIANCE MATRICES
Authors
Keywordsadaptive estimator
eigenvector
principal component
Random matrix
sample covariance matrix
spiked model
Issue Date2022
Citation
Annals of Statistics, 2022, v. 50, n. 2, p. 1144-1169 How to Cite?
AbstractIn this paper, we study the asymptotic behavior of the extreme eigenvalues and eigenvectors of the high-dimensional spiked sample covariance matrices, in the supercritical case when a reliable detection of spikes is possible. In particular, we derive the joint distribution of the extreme eigenvalues and the generalized components of the associated eigenvectors, that is, the projections of the eigenvectors onto arbitrary given direction, assuming that the dimension and sample size are comparably large. In general, the joint distribution is given in terms of linear combinations of finitely many Gaussian and Chi-square variables, with parameters depending on the projection direction and the spikes. Our assumption on the spikes is fully general. First, the strengths of spikes are only required to be slightly above the critical threshold and no upper bound on the strengths is needed. Second, multiple spikes, that is, spikes with the same strength, are allowed. Third, no structural assumption is imposed on the spikes. Thanks to the general setting, we can then apply the results to various high dimensional statistical hypothesis testing problems involving both the eigenvalues and eigenvectors. Specifically, we propose accurate and powerful statistics to conduct hypothesis testing on the principal components. These statistics are data-dependent and adaptive to the underlying true spikes. Numerical simulations also confirm the accuracy and powerfulness of our proposed statistics and illustrate significantly better performance compared to the existing methods in the literature. In particular, our methods are accurate and powerful even when either the spikes are small or the dimension is large.
Persistent Identifierhttp://hdl.handle.net/10722/349722
ISSN
2023 Impact Factor: 3.2
2023 SCImago Journal Rankings: 5.335

 

DC FieldValueLanguage
dc.contributor.authorBao, Zhigang-
dc.contributor.authorDing, Xiucai-
dc.contributor.authorWang, Jingming-
dc.contributor.authorWang, Ke-
dc.date.accessioned2024-10-17T07:00:23Z-
dc.date.available2024-10-17T07:00:23Z-
dc.date.issued2022-
dc.identifier.citationAnnals of Statistics, 2022, v. 50, n. 2, p. 1144-1169-
dc.identifier.issn0090-5364-
dc.identifier.urihttp://hdl.handle.net/10722/349722-
dc.description.abstractIn this paper, we study the asymptotic behavior of the extreme eigenvalues and eigenvectors of the high-dimensional spiked sample covariance matrices, in the supercritical case when a reliable detection of spikes is possible. In particular, we derive the joint distribution of the extreme eigenvalues and the generalized components of the associated eigenvectors, that is, the projections of the eigenvectors onto arbitrary given direction, assuming that the dimension and sample size are comparably large. In general, the joint distribution is given in terms of linear combinations of finitely many Gaussian and Chi-square variables, with parameters depending on the projection direction and the spikes. Our assumption on the spikes is fully general. First, the strengths of spikes are only required to be slightly above the critical threshold and no upper bound on the strengths is needed. Second, multiple spikes, that is, spikes with the same strength, are allowed. Third, no structural assumption is imposed on the spikes. Thanks to the general setting, we can then apply the results to various high dimensional statistical hypothesis testing problems involving both the eigenvalues and eigenvectors. Specifically, we propose accurate and powerful statistics to conduct hypothesis testing on the principal components. These statistics are data-dependent and adaptive to the underlying true spikes. Numerical simulations also confirm the accuracy and powerfulness of our proposed statistics and illustrate significantly better performance compared to the existing methods in the literature. In particular, our methods are accurate and powerful even when either the spikes are small or the dimension is large.-
dc.languageeng-
dc.relation.ispartofAnnals of Statistics-
dc.subjectadaptive estimator-
dc.subjecteigenvector-
dc.subjectprincipal component-
dc.subjectRandom matrix-
dc.subjectsample covariance matrix-
dc.subjectspiked model-
dc.titleSTATISTICAL INFERENCE FOR PRINCIPAL COMPONENTS OF SPIKED COVARIANCE MATRICES-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1214/21-AOS2143-
dc.identifier.scopuseid_2-s2.0-85130498186-
dc.identifier.volume50-
dc.identifier.issue2-
dc.identifier.spage1144-
dc.identifier.epage1169-
dc.identifier.eissn2168-8966-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats