Nonlinear principal component analysis of noisy data

被引:32
|
作者
Hsieh, William W. [1 ]
机构
[1] Univ British Columbia, Dept Earth & Ocean Sci, Vancouver, BC V6T 1Z4, Canada
关键词
nonlinear principal component analysis; information criterion; model selection; autoassociative neural network; regularization; El Nino; ENSO; quasibiennial oscillation;
D O I
10.1016/j.neunet.2007.04.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With very noisy data, having plentiful samples eliminates overfitting in nonlinear regression, but not in nonlinear principal component analysis (NLPCA). To overcome this problem in NLPCA, a new information criterion (IC) is proposed for selecting the best model among multiple models with different complexity and regularization (i.e. weight penalty). This IC gauges the inconsistency 1 between the nonlinear principal components (u and u) for every data point x and its nearest neighbour x, with 1 = 1 - correlation(u, u), where I tends to increase with overfilled solutions. Tests were performed using autoassociative neural networks for NLPCA on synthetic and real climate data (tropical Pacific sea surface temperatures and equatorial stratospheric winds), with the IC performing well in model selection and in deciding between an open curve or a closed curve solution. (C) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:434 / 443
页数:10
相关论文
共 50 条
  • [1] Nonlinear principal component analysis of noisy data
    Hsieh, William W.
    [J]. 2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 4582 - 4586
  • [2] Principal Component Analysis with Noisy and/or Missing Data
    Bailey, Stephen
    [J]. PUBLICATIONS OF THE ASTRONOMICAL SOCIETY OF THE PACIFIC, 2012, 124 (919) : 1015 - 1023
  • [3] IMPROVED PRINCIPAL COMPONENT ANALYSIS OF NOISY DATA
    FAY, MJ
    PROCTOR, A
    HOFFMANN, DP
    HERCULES, DM
    [J]. ANALYTICAL CHEMISTRY, 1991, 63 (11) : 1058 - 1063
  • [4] A nonlinear principal component analysis on image data
    Saegusa, R
    Sakano, H
    Hashimoto, S
    [J]. MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 589 - 598
  • [5] A Nonlinear principal component analysis of image data
    Saegusa, R
    Sakano, H
    Hashimoto, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (10): : 2242 - 2248
  • [6] Nonlinear and additive principal component analysis for functional data
    Song, Jun
    Li, Bing
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2021, 181
  • [7] A nonlinear principal component analysis to study archeometric data
    Bitetto, Alessandro
    Mangone, Annarosa
    Mininni, Rosa Maria
    Giannossa, Lorena C.
    [J]. JOURNAL OF CHEMOMETRICS, 2016, 30 (07) : 405 - 415
  • [8] Streaming Principal Component Analysis in Noisy Settings
    Marinov, Teodor, V
    Mianjy, Poorya
    Arora, Raman
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [9] SAR Data Fusion Using Nonlinear Principal Component Analysis
    Fasano, Luca
    Latini, Daniele
    Machidon, Alina
    Clementini, Chiara
    Schiavon, Giovanni
    Del Frate, Fabio
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (09) : 1543 - 1547
  • [10] Deep Contrastive Principal Component Analysis Adaptive to Nonlinear Data
    Cao, Hongjie
    Wang, Gang
    Sun, Jian
    Deng, Fang
    Chen, Jie
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 5738 - 5750