Subspace perspective on canonical correlation analysis: Dimension reduction and minimax rates

被引:5
|
作者
Ma, Zhuang [1 ]
Li, Xiaodong [2 ]
机构
[1] Univ Penn, Wharton Sch, Dept Stat, 3730 Walnut St,Suite 400, Philadelphia, PA 19104 USA
[2] Univ Calif Davis, Dept Stat, Davis, CA 95616 USA
关键词
canonical correlation analysis; dimension reduction; minimax rates;
D O I
10.3150/19-BEJ1131
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Canonical correlation analysis (CCA) is a fundamental statistical tool for exploring the correlation structure between two sets of random variables. In this paper, motivated by the recent success of applying CCA to learn low dimensional representations of high dimensional objects, we propose two losses based on the principal angles between the model spaces spanned by the sample canonical variates and their population correspondents, respectively. We further characterize the non-asymptotic error bounds for the estimation risks under the proposed error metrics, which reveal how the performance of sample CCA depends adaptively on key quantities including the dimensions, the sample size, the condition number of the covariance matrices and particularly the population canonical correlation coefficients. The optimality of our uniform upper bounds is also justified by lower-bound analysis based on stringent and localized parameter spaces. To the best of our knowledge, for the first time our paper separates p(1) and p(2) for the first order term in the upper bounds without assuming the residual correlations are zeros. More significantly, our paper derives (1 - lambda(2)(k))( 1 - lambda(2)(k+1))/(lambda(k) - lambda(k+1))(2) for the first time in the non-asymptotic CCA estimation convergence k+1 rates, which is essential to understand the behavior of CCA when the leading canonical correlation coefficients are close to 1.
引用
收藏
页码:432 / 470
页数:39
相关论文
共 50 条
  • [1] Dimension reduction based on canonical correlation
    Fung, WK
    He, XM
    Liu, L
    Shi, PD
    STATISTICA SINICA, 2002, 12 (04) : 1093 - 1113
  • [2] Multivariate Association and Dimension Reduction: A Generalization of Canonical Correlation Analysis
    Iaci, Ross
    Sriram, T. N.
    Yin, Xiangrong
    BIOMETRICS, 2010, 66 (04) : 1107 - 1118
  • [3] MINIMAX ESTIMATION IN SPARSE CANONICAL CORRELATION ANALYSIS
    Gao, Chao
    Ma, Zongming
    Ren, Zhao
    Zhou, Harrison H.
    ANNALS OF STATISTICS, 2015, 43 (05): : 2168 - 2197
  • [4] Robust dimension reduction based on canonical correlation
    Zhou, Jianhui
    JOURNAL OF MULTIVARIATE ANALYSIS, 2009, 100 (01) : 195 - 209
  • [5] Tensor Canonical Correlation Analysis for Multi-View Dimension Reduction
    Luo, Yong
    Tao, Dacheng
    Ramamohanarao, Kotagiri
    Xu, Chao
    Wen, Yonggang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (11) : 3111 - 3124
  • [6] Tensor Canonical Correlation Analysis for Multi-view Dimension Reduction
    Luo, Yong
    Tao, Dacheng
    Ramamohanarao, Kotagiri
    Xu, Chao
    Wen, Yonggang
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1460 - 1461
  • [7] Dimension reduction in functional regression using mixed data canonical correlation analysis
    Wang, Guochang
    Lin, Nan
    Zhang, Baoxue
    STATISTICS AND ITS INTERFACE, 2013, 6 (02) : 187 - 196
  • [8] Generalized Canonical Correlation Analysis: A Subspace Intersection Approach
    Sorensen, Mikael
    Kanatsoulis, Charilaos, I
    Sidiropoulos, Nicholas D.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 : 2452 - 2467
  • [9] High-throughput data dimension reduction via seeded canonical correlation analysis
    Im, Yunju
    Gang, HeyIn
    Yoo, Jae Keun
    JOURNAL OF CHEMOMETRICS, 2015, 29 (03) : 193 - 199
  • [10] Dimension reduction based on constrained canonical correlation and variable filtering
    Zhou, Jianhui
    He, Xuming
    ANNALS OF STATISTICS, 2008, 36 (04): : 1649 - 1668