File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1214/18-AOS1704
- Scopus: eid_2-s2.0-85057858403
- Find via
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Article: Canonical correlation coefficients of high-dimensional Gaussian vectors: Finite rank case
Title | Canonical correlation coefficients of high-dimensional Gaussian vectors: Finite rank case |
---|---|
Authors | |
Keywords | Canonical correlation analysis Finite rank perturbation High-dimensional data Largest eigenvalues MANOVA ensemble Random matrices |
Issue Date | 2019 |
Citation | Annals of Statistics, 2019, v. 47, n. 1, p. 612-640 How to Cite? |
Abstract | Consider a Gaussian vector z = (x, y), consisting of two sub-vectors x and y with dimensions p and q, respectively. With n independent observations of z, we study the correlation between x and y, from the perspective of the canonical correlation analysis. We investigate the high-dimensional case: both p and q are proportional to the sample size n. Denote by uv the population cross-covariance matrix of random vectors u and v, and denote by Suv the sample counterpart. The canonical correlation coefficients between x and y are known as the square roots of the nonzero eigenvalues of the canonical correlation matrix xx−1xyyy−1yx. In this paper, we focus on the case that xy is of finite rank k, that is, there are k nonzero canonical correlation coefficients, whose squares are denoted by r1 ≥ · · · ≥ rk > 0. We study the sample counterparts of ri, i = 1, . . ., k, that is, the largest k eigenvalues of the sample canonical correlation matrix Sxx−1SxySyy−1Syx, denoted by λ1 ≥ · · · ≥ λk. We show that there exists a threshold rc ∈ (0, 1), such that for each i ∈ {1, . . ., k}, when ri ≤ rc, λi converges almost surely to the right edge of the limiting spectral distribution of the sample canonical correlation matrix, denoted by d+. When ri > rc, λi possesses an almost sure limit in (d+, 1], from which we can recover ri’s in turn, thus provide an estimate of the latter in the high-dimensional scenario. We also obtain the limiting distribution of λi’s under appropriate normalization. Specifically, λi possesses Gaussian type fluctuation if ri > rc, and follows Tracy–Widom distribution if ri < rc. Some applications of our results are also discussed. |
Persistent Identifier | http://hdl.handle.net/10722/349295 |
ISSN | 2023 Impact Factor: 3.2 2023 SCImago Journal Rankings: 5.335 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Bao, Zhigang | - |
dc.contributor.author | Hu, Jiang | - |
dc.contributor.author | Pan, Guangming | - |
dc.contributor.author | Zhou, Wang | - |
dc.date.accessioned | 2024-10-17T06:57:35Z | - |
dc.date.available | 2024-10-17T06:57:35Z | - |
dc.date.issued | 2019 | - |
dc.identifier.citation | Annals of Statistics, 2019, v. 47, n. 1, p. 612-640 | - |
dc.identifier.issn | 0090-5364 | - |
dc.identifier.uri | http://hdl.handle.net/10722/349295 | - |
dc.description.abstract | Consider a Gaussian vector z = (x, y), consisting of two sub-vectors x and y with dimensions p and q, respectively. With n independent observations of z, we study the correlation between x and y, from the perspective of the canonical correlation analysis. We investigate the high-dimensional case: both p and q are proportional to the sample size n. Denote by uv the population cross-covariance matrix of random vectors u and v, and denote by Suv the sample counterpart. The canonical correlation coefficients between x and y are known as the square roots of the nonzero eigenvalues of the canonical correlation matrix xx−1xyyy−1yx. In this paper, we focus on the case that xy is of finite rank k, that is, there are k nonzero canonical correlation coefficients, whose squares are denoted by r1 ≥ · · · ≥ rk > 0. We study the sample counterparts of ri, i = 1, . . ., k, that is, the largest k eigenvalues of the sample canonical correlation matrix Sxx−1SxySyy−1Syx, denoted by λ1 ≥ · · · ≥ λk. We show that there exists a threshold rc ∈ (0, 1), such that for each i ∈ {1, . . ., k}, when ri ≤ rc, λi converges almost surely to the right edge of the limiting spectral distribution of the sample canonical correlation matrix, denoted by d+. When ri > rc, λi possesses an almost sure limit in (d+, 1], from which we can recover ri’s in turn, thus provide an estimate of the latter in the high-dimensional scenario. We also obtain the limiting distribution of λi’s under appropriate normalization. Specifically, λi possesses Gaussian type fluctuation if ri > rc, and follows Tracy–Widom distribution if ri < rc. Some applications of our results are also discussed. | - |
dc.language | eng | - |
dc.relation.ispartof | Annals of Statistics | - |
dc.subject | Canonical correlation analysis | - |
dc.subject | Finite rank perturbation | - |
dc.subject | High-dimensional data | - |
dc.subject | Largest eigenvalues | - |
dc.subject | MANOVA ensemble | - |
dc.subject | Random matrices | - |
dc.title | Canonical correlation coefficients of high-dimensional Gaussian vectors: Finite rank case | - |
dc.type | Article | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1214/18-AOS1704 | - |
dc.identifier.scopus | eid_2-s2.0-85057858403 | - |
dc.identifier.volume | 47 | - |
dc.identifier.issue | 1 | - |
dc.identifier.spage | 612 | - |
dc.identifier.epage | 640 | - |