A graph regularized dimension reduction method for out-of-sample data

被引:11
|
作者
Tang, Mengfan [1 ]
Nie, Feiping [2 ,3 ]
Jain, Ramesh [1 ]
机构
[1] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92697 USA
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian, Peoples R China
[3] Northwestern Polytech Univ, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian, Peoples R China
关键词
Dimension reduction; Out-of-sample data; Graph regularized PCA; Manifold learning; Clustering; RECOGNITION; EIGENMAPS;
D O I
10.1016/j.neucom.2016.11.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Among various dimension reduction techniques, Principal Component Analysis (PCA) is specialized in treating vector data, whereas Laplacian embedding is often employed for embedding graph data. Moreover, graph regularized PCA, a combination of both techniques, has also been developed to assist the learning of a low dimensional representation of vector data by incorporating graph data. However, these approaches are confronted by the out-of-sample problem: each time when new data is added, it has to be combined with the old data before being fed into the algorithm to re-compute the eigenvectors, leading to enormous computational cost. In order to address this problem, we extend the graph regularized PCA to the graph regularized linear regression PCA (grlrPCA). grlrPCA eliminates the redundant calculation on the old data by first learning a linear function and then directly applying it to the new data for its dimension reduction. Furthermore, we derive an efficient iterative algorithm to solve grlrPCA optimization problem and show the close relatedness of grlrPCA and unsupervised Linear Discriminant Analysis at infinite regularization parameter limit. The evaluations of multiple metrics on seven realistic datasets demonstrate that grlrPCA outperforms established unsupervised dimension reduction algorithms.
引用
收藏
页码:58 / 63
页数:6
相关论文
共 50 条
  • [1] Prediction of Spatial Point Processes: Regularized Method with Out-of-Sample Guarantees
    Osama, Muhammad
    Zachariah, Dave
    Stoica, Peter
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [2] Out-of-sample extension of graph adjacency spectral embedding
    Levin, Keith
    Roosta-Khorasani, Farbod
    Mahoney, Michael W.
    Priebe, Carey E.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [3] DATA REVISIONS AND OUT-OF-SAMPLE STOCK RETURN PREDICTABILITY
    Guo, Hui
    ECONOMIC INQUIRY, 2009, 47 (01) : 81 - 97
  • [4] Image classification with manifold learning for out-of-sample data
    Han, Yahong
    Xu, Zhongwen
    Ma, Zhigang
    Huang, Zi
    SIGNAL PROCESSING, 2013, 93 (08) : 2169 - 2177
  • [5] LEARNING GENERAL TRANSFORMATIONS OF DATA FOR OUT-OF-SAMPLE EXTENSIONS
    Amodio, Matthew
    van Dijk, David
    Wolf, Guy
    Krishnaswamy, Smita
    PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
  • [6] Classification of small lesions on dynamic breast MRI: Integrating dimension reduction and out-of-sample extension into CADx methodology
    Nagarajan, Mahesh B.
    Huber, Markus B.
    Schlossbauer, Thomas
    Leinsinger, Gerda
    Krol, Andrzej
    Wismueller, Axel
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2014, 60 (01) : 65 - 77
  • [7] Online landmark replacement for out-of-sample dimensionality reduction methods
    Thongprayoon, Chanon
    Masuda, Naoki
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2024, 480 (2300):
  • [8] Out-of-Sample Extension for Dimensionality Reduction of Noisy Time Series
    Dadkhahi, Hamid
    Duarte, Marco F.
    Marlin, Benjamin M.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (11) : 5435 - 5446
  • [9] Robust out-of-sample inference
    McCracken, MW
    JOURNAL OF ECONOMETRICS, 2000, 99 (02) : 195 - 223
  • [10] ISOMAP OUT-OF-SAMPLE EXTENSION FOR NOISY TIME SERIES DATA
    Dadkhahi, Hamid
    Duarte, Marco F.
    Marlin, Benjamin
    2015 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2015,