On estimation of the noise variance in high dimensional probabilistic principal component analysis

被引:28
|
作者
Passemier, Damien [1 ]
Li, Zhaoyuan [1 ]
Yao, Jianfeng [1 ]
机构
[1] Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
关键词
Goodness of fit; High dimensional data; Noise variance estimator; Number of principal components; Probabilistic principal component analysis; Random-matrix theory; CENTRAL LIMIT-THEOREMS; FACTOR MODELS; COVARIANCE-MATRIX; NUMBER; EIGENVALUES; TESTS;
D O I
10.1111/rssb.12153
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We develop new statistical theory for probabilistic principal component analysis models in high dimensions. The focus is the estimation of the noise variance, which is an important and unresolved issue when the number of variables is large in comparison with the sample size. We first unveil the reasons for an observed downward bias of the maximum likelihood estimator of the noise variance when the data dimension is high. We then propose a bias-corrected estimator by using random-matrix theory and establish its asymptotic normality. The superiority of the new and bias-corrected estimator over existing alternatives is checked by Monte Carlo experiments with various combinations of (p,n) (the dimension and sample size). Next, we construct a new criterion based on the bias-corrected estimator to determine the number of the principal components, and a consistent estimator is obtained. Its good performance is confirmed by a simulation study and real data analysis. The bias-corrected estimator is also used to derive new asymptotics for the related goodness-of-fit statistic under the high dimensional scheme.
引用
收藏
页码:51 / 67
页数:17
相关论文
共 50 条
  • [1] Fast Noise Variance Estimation by Principal Component Analysis
    Pyatykh, Stanislav
    Zheng, Lei
    Hesser, Juergen
    [J]. IMAGE PROCESSING: ALGORITHMS AND SYSTEMS XI, 2013, 8655
  • [2] Noise Variance Estimation Method Based on Regression Analysis and Principal Component Analysis
    Wu Jiang
    You Fei
    Jiang Ping
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (05) : 1195 - 1201
  • [3] A Cure for Variance Inflation in High Dimensional Kernel Principal Component Analysis
    Abrahamsen, Trine Julie
    Hansen, Lars Kai
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 2027 - 2044
  • [4] Image Noise Estimation Based on Principal Component Analysis and Variance-Stabilizing Transformation
    Ding, Ling
    Zhang, Huying
    Li, Bijun
    Xiao, Jinsheng
    Zhou, Jian
    [J]. IMAGE AND GRAPHICS (ICIG 2017), PT III, 2017, 10668 : 58 - 69
  • [5] Probabilistic two-dimensional principal component analysis
    College of Information Science and Technology, East China University of Science and Technology, Shanghai 200237, China
    [J]. Zidonghua Xuebao, 2008, 3 (353-359):
  • [6] Image Noise Level Estimation by Principal Component Analysis
    Pyatykh, Stanislav
    Hesser, Juergen
    Zheng, Lei
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (02) : 687 - 699
  • [7] Continuous estimation of distribution algorithms with probabilistic principal component analysis
    Cho, DY
    Zhang, BT
    [J]. PROCEEDINGS OF THE 2001 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2001, : 521 - 526
  • [8] On the Consistency of Maximum Likelihood Estimation of Probabilistic Principal Component Analysis
    Datta, Arghya
    Chakrabarty, Sayak
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [9] HIGH DIMENSIONAL MATRIX ESTIMATION WITH UNKNOWN VARIANCE OF THE NOISE
    Gaiffas, Stephane
    Klopp, Olga
    [J]. STATISTICA SINICA, 2017, 27 (01) : 115 - 145
  • [10] Probabilistic principal component analysis
    Tipping, ME
    Bishop, CM
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1999, 61 : 611 - 622