Nonlinear principal component analysis of noisy data

被引:0
|
作者
Hsieh, William W. [1 ]
机构
[1] Univ British Columbia, Dept Earth & Ocean Sci, Vancouver, BC V6T 1Z4, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With very noisy data, overfitting is a serious problem in pattern recognition. For nonlinear regression, having plentiful data eliminates overfitting, but for nonlinear principal component analysis (NLPCA), overfitting persists even with plentiful data. Thus simply minimizing mean square error is not a sufficient criterion for NLPCA to find good solutions in noisy data. A new index is proposed which measures the disparity between the nonlinear principal components u and (u) over tilde for a data point x and its nearest neighbour (x) over tilde. This index, 1 - C-S (the Spearman rank correlation between u and (u) over tilde), tends to increase with overfitted solutions, thereby providing a diagnostic tool to determine how much regularization (i.e. weight penalty) should be used in the objective function of the NLPCA to prevent overfitting. Tests are performed using autoassociative neural networks for NLPCA on synthetic and real climate data.
引用
收藏
页码:4582 / 4586
页数:5
相关论文
共 50 条
  • [1] Nonlinear principal component analysis of noisy data
    Hsieh, William W.
    [J]. NEURAL NETWORKS, 2007, 20 (04) : 434 - 443
  • [2] Principal Component Analysis with Noisy and/or Missing Data
    Bailey, Stephen
    [J]. PUBLICATIONS OF THE ASTRONOMICAL SOCIETY OF THE PACIFIC, 2012, 124 (919) : 1015 - 1023
  • [3] IMPROVED PRINCIPAL COMPONENT ANALYSIS OF NOISY DATA
    FAY, MJ
    PROCTOR, A
    HOFFMANN, DP
    HERCULES, DM
    [J]. ANALYTICAL CHEMISTRY, 1991, 63 (11) : 1058 - 1063
  • [4] A nonlinear principal component analysis on image data
    Saegusa, R
    Sakano, H
    Hashimoto, S
    [J]. MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 589 - 598
  • [5] A Nonlinear principal component analysis of image data
    Saegusa, R
    Sakano, H
    Hashimoto, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (10): : 2242 - 2248
  • [6] Nonlinear and additive principal component analysis for functional data
    Song, Jun
    Li, Bing
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2021, 181
  • [7] A nonlinear principal component analysis to study archeometric data
    Bitetto, Alessandro
    Mangone, Annarosa
    Mininni, Rosa Maria
    Giannossa, Lorena C.
    [J]. JOURNAL OF CHEMOMETRICS, 2016, 30 (07) : 405 - 415
  • [8] Streaming Principal Component Analysis in Noisy Settings
    Marinov, Teodor, V
    Mianjy, Poorya
    Arora, Raman
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [9] SAR Data Fusion Using Nonlinear Principal Component Analysis
    Fasano, Luca
    Latini, Daniele
    Machidon, Alina
    Clementini, Chiara
    Schiavon, Giovanni
    Del Frate, Fabio
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (09) : 1543 - 1547
  • [10] Deep Contrastive Principal Component Analysis Adaptive to Nonlinear Data
    Cao, Hongjie
    Wang, Gang
    Sun, Jian
    Deng, Fang
    Chen, Jie
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 5738 - 5750