A Principal Component Analysis Algorithm Based on Dimension Reduction Window

被引:8
|
作者
Zhang, Rui
Du, Tao [1 ]
Qu, Shouning
机构
[1] Univ Jinan, Sch Informat Sci & Engn, Jinan 250022, Shandong, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Dimensionality reduction; correlation coefficient matrix; principal component analysis; dimension reduction window; OPTIMIZATION ALGORITHM; FEATURE-EXTRACTION; ROBUST PCA; KPCA; CLASSIFICATION;
D O I
10.1109/ACCESS.2018.2875270
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dimensionality reduction is an essential preprocessing step for data mining. Principal component analysis (PCA) is the most classical method of reducing dimension and a variety of methods based on it are extended. However, all these methods require at least one transposition and quadrature operation of the original high-dimensional matrix and the dimension reduction results loss the meaning of the original data, it will inevitably bring difficulties for people to the further analysis of classification or clustering results. We develop a novel algorithm named DRWPCA in this paper, it does not need to map the original data to the space of other dimensions for processing, but realizes the dimension reduction by analyzing the correlation between the dimensions, and therefore the physical meaning of the original data set is retained. It utilizes mathematical statistics to obtain the correlation coefficient or the degree of correlation between attributes. By statistical analysis of the degree of correlation between attributes, the feature with high correlation is removed so as to achieve the goal of reducing the dimension. DRWPCA is inspired by the content of the correlation coefficient part of the digital feature of a random variable, and the sliding window model for traffic control in network engineering. Experimental result demonstrates that the DRWPCA provides promising accuracy, higher ability to reduce dimension and preserves the original information of the data.
引用
收藏
页码:63737 / 63747
页数:11
相关论文
共 50 条
  • [1] Dimension reduction by local principal component analysis
    Kambhatla, N
    Leen, TK
    [J]. NEURAL COMPUTATION, 1997, 9 (07) : 1493 - 1516
  • [2] Dimension reduction in principal component analysis for trees
    Alfaro, Carlos A.
    Aydin, Burcu
    Valencia, Carlos E.
    Bullitt, Elizabeth
    Ladha, Alim
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 74 : 157 - 179
  • [3] Study of Defect Feature Dimension Reduction Based on Principal Component Analysis
    Han Fangfang
    Zhu Junchao
    Zhang Baofeng
    Duan Fajie
    [J]. PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1367 - 1371
  • [4] Effect of dimension reduction by principal component analysis on clustering
    Erisoglu, Murat
    Erisoglu, Ulku
    [J]. JOURNAL OF STATISTICS AND MANAGEMENT SYSTEMS, 2011, 14 (02) : 277 - 287
  • [5] High-dimensional Data Classification Based on Principal Component Analysis Dimension Reduction and Improved BP Algorithm
    Yan, Tai-shan
    Wen, Yi-ting
    Li, Wen-bin
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMMUNICATION, NETWORK AND ARTIFICIAL INTELLIGENCE (CNAI 2018), 2018, : 441 - 445
  • [6] Dimension reduction in radio maps based on the supervised kernel principal component analysis
    Bing Jia
    Baoqi Huang
    Hepeng Gao
    Wuyungerile Li
    [J]. Soft Computing, 2018, 22 : 7697 - 7703
  • [7] Dimension reduction in radio maps based on the supervised kernel principal component analysis
    Jia, Bing
    Huang, Baoqi
    Gao, Hepeng
    Li, Wuyungerile
    [J]. SOFT COMPUTING, 2018, 22 (23) : 7697 - 7703
  • [8] Dimension selection for feature selection and dimension reduction with principal and independent component analysis
    Koch, Inge
    Naito, Kanta
    [J]. NEURAL COMPUTATION, 2007, 19 (02) : 513 - 545
  • [9] Reduction of the multivariate input dimension using principal component analysis
    Xi, Jianhui
    Han, Min
    [J]. PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 1047 - 1051
  • [10] A Geometric Algorithm for Contrastive Principal Component Analysis in High Dimension
    Lu, Rung-Sheng
    Wang, Shao-Hsuan
    Huang, Su-Yun
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024, 33 (03) : 909 - 916