File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Canonical correlation coefficients of high-dimensional Gaussian vectors: Finite rank case

TitleCanonical correlation coefficients of high-dimensional Gaussian vectors: Finite rank case
Authors
KeywordsCanonical correlation analysis
Finite rank perturbation
High-dimensional data
Largest eigenvalues
MANOVA ensemble
Random matrices
Issue Date2019
Citation
Annals of Statistics, 2019, v. 47, n. 1, p. 612-640 How to Cite?
AbstractConsider a Gaussian vector z = (x, y), consisting of two sub-vectors x and y with dimensions p and q, respectively. With n independent observations of z, we study the correlation between x and y, from the perspective of the canonical correlation analysis. We investigate the high-dimensional case: both p and q are proportional to the sample size n. Denote by uv the population cross-covariance matrix of random vectors u and v, and denote by Suv the sample counterpart. The canonical correlation coefficients between x and y are known as the square roots of the nonzero eigenvalues of the canonical correlation matrix xx−1xyyy−1yx. In this paper, we focus on the case that xy is of finite rank k, that is, there are k nonzero canonical correlation coefficients, whose squares are denoted by r1 ≥ · · · ≥ rk > 0. We study the sample counterparts of ri, i = 1, . . ., k, that is, the largest k eigenvalues of the sample canonical correlation matrix Sxx−1SxySyy−1Syx, denoted by λ1 ≥ · · · ≥ λk. We show that there exists a threshold rc ∈ (0, 1), such that for each i ∈ {1, . . ., k}, when ri ≤ rc, λi converges almost surely to the right edge of the limiting spectral distribution of the sample canonical correlation matrix, denoted by d+. When ri > rc, λi possesses an almost sure limit in (d+, 1], from which we can recover ri’s in turn, thus provide an estimate of the latter in the high-dimensional scenario. We also obtain the limiting distribution of λi’s under appropriate normalization. Specifically, λi possesses Gaussian type fluctuation if ri > rc, and follows Tracy–Widom distribution if ri < rc. Some applications of our results are also discussed.
Persistent Identifierhttp://hdl.handle.net/10722/349295
ISSN
2023 Impact Factor: 3.2
2023 SCImago Journal Rankings: 5.335

 

DC FieldValueLanguage
dc.contributor.authorBao, Zhigang-
dc.contributor.authorHu, Jiang-
dc.contributor.authorPan, Guangming-
dc.contributor.authorZhou, Wang-
dc.date.accessioned2024-10-17T06:57:35Z-
dc.date.available2024-10-17T06:57:35Z-
dc.date.issued2019-
dc.identifier.citationAnnals of Statistics, 2019, v. 47, n. 1, p. 612-640-
dc.identifier.issn0090-5364-
dc.identifier.urihttp://hdl.handle.net/10722/349295-
dc.description.abstractConsider a Gaussian vector z = (x, y), consisting of two sub-vectors x and y with dimensions p and q, respectively. With n independent observations of z, we study the correlation between x and y, from the perspective of the canonical correlation analysis. We investigate the high-dimensional case: both p and q are proportional to the sample size n. Denote by uv the population cross-covariance matrix of random vectors u and v, and denote by Suv the sample counterpart. The canonical correlation coefficients between x and y are known as the square roots of the nonzero eigenvalues of the canonical correlation matrix xx−1xyyy−1yx. In this paper, we focus on the case that xy is of finite rank k, that is, there are k nonzero canonical correlation coefficients, whose squares are denoted by r1 ≥ · · · ≥ rk > 0. We study the sample counterparts of ri, i = 1, . . ., k, that is, the largest k eigenvalues of the sample canonical correlation matrix Sxx−1SxySyy−1Syx, denoted by λ1 ≥ · · · ≥ λk. We show that there exists a threshold rc ∈ (0, 1), such that for each i ∈ {1, . . ., k}, when ri ≤ rc, λi converges almost surely to the right edge of the limiting spectral distribution of the sample canonical correlation matrix, denoted by d+. When ri > rc, λi possesses an almost sure limit in (d+, 1], from which we can recover ri’s in turn, thus provide an estimate of the latter in the high-dimensional scenario. We also obtain the limiting distribution of λi’s under appropriate normalization. Specifically, λi possesses Gaussian type fluctuation if ri > rc, and follows Tracy–Widom distribution if ri < rc. Some applications of our results are also discussed.-
dc.languageeng-
dc.relation.ispartofAnnals of Statistics-
dc.subjectCanonical correlation analysis-
dc.subjectFinite rank perturbation-
dc.subjectHigh-dimensional data-
dc.subjectLargest eigenvalues-
dc.subjectMANOVA ensemble-
dc.subjectRandom matrices-
dc.titleCanonical correlation coefficients of high-dimensional Gaussian vectors: Finite rank case-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1214/18-AOS1704-
dc.identifier.scopuseid_2-s2.0-85057858403-
dc.identifier.volume47-
dc.identifier.issue1-
dc.identifier.spage612-
dc.identifier.epage640-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats