Research on Distributed Parallel Dimensionality Reduction Algorithm Based on PCA Algorithm

被引:0
|
作者
Wang, Linlin [1 ]
机构
[1] Qilu Univ Technol, Coll Comp Sci & Technol, Jinan, Shandong, Peoples R China
关键词
PCA algorithm; dimensionality reduction; distributed parallel; correlation coefficient matrix; Storm platform;
D O I
10.1109/itnec.2019.8729427
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
PCA algorithm is a typical data dimensionality reduction method, which projects high-dimensional data to a lower-dimensional space to obtain a low-dimensional data set that can maximally represent these characteristics of the original data set. The PCA algorithm can effectively achieve dimensionality reduction for high-dimensional data and is widely used in various fields. Aimed at the tedious calculation process of PCA algorithm and the time-consuming of processing massive stream data, this paper proposes a distributed parallel dimensionality reduction algorithm that called DP-PCA by improving the PCA algorithm. Based on the theory of PCA algorithm, DP-PCA algorithm includes three parts of improvement research. Firstly, the original data set is preprocessed by using the "mean" method. Secondly, the solution process of correlation coefficient matrix is improved. Thirdly, this paper designs a distributed parallel dimensionality reduction scheme for DP-PCA algorithm. In addition, this paper deploys DP-PCA algorithm on Storm platform to realize parallelization of the algorithm, and tests the DP-PCA algorithm. Experiments show that DP-PCA algorithm improves computational efficiency and reduces the dimensionality reduction time, and improves the speedup ratio.
引用
收藏
页码:1363 / 1367
页数:5
相关论文
共 50 条
  • [1] Multilevel parallel algorithm of PCA dimensionality reduction for hyperspectral image on GPU
    [J]. Fang, Min-Quan (877086820@qq.com), 1600, Northeast University (35):
  • [2] Distributed Parallel Adaptive Clustering algorithm based on Clique and high dimensionality reduction
    LinJiaQin
    [J]. 2011 6TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY (ICCIT), 2012, : 352 - 357
  • [3] Unsupervised Learning Dimensionality Reduction Algorithm PCA For Face Recognition
    Kumar, Vivek
    Kalitin, Denis
    Tiwari, Prayag
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2017, : 32 - +
  • [4] Image Dimensionality Reduction Based on the Intrinsic Dimension and Parallel Genetic Algorithm
    Lei, Liang
    Wang, TongQing
    Peng, Jun
    Yang, Bo
    [J]. INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2011, 5 (02) : 97 - 112
  • [5] Research of Incremental Dimensionality Reduction Based on Tensor Decomposition Algorithm
    Guo, Xin
    Xiang, Yang
    Lv, Dongdong
    Yuan, Shuhan
    Huang, Yinfei
    Zhang, Qi
    Wang, Jisheng
    Wang, Dong
    [J]. WIRELESS COMMUNICATIONS, NETWORKING AND APPLICATIONS, WCNA 2014, 2016, 348 : 87 - 94
  • [6] Research on Distributed Heterogeneous Data PCA Algorithm Based on Cloud Platform
    Zhang, Jin
    Huang, Gang
    [J]. 6TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN, MANUFACTURING, MODELING AND SIMULATION (CDMMS 2018), 2018, 1967
  • [7] A Parallel AES Encryption Algorithm Based on PCA
    Das, Debasis
    Misra, Rajiv
    [J]. ADVANCES IN PARALLEL, DISTRIBUTED COMPUTING, 2011, 203 : 238 - 246
  • [8] Research of Distributed Algorithm based on Parallel Computer Cluster System
    Xu He-li
    Liu Yan
    [J]. THIRD INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY (ISCSCT 2010), 2010, : 369 - 372
  • [9] Research on parallel algorithm based on hadoop distributed computing platform
    Heilongjiang University of Technology, Jixi, China
    [J]. Int. J. Grid Distrib. Comput., 4 (163-170):
  • [10] Research on Distributed Parallel Eclat Optimization Algorithm
    Huang Qiufeng
    Li Qiang
    Huang Shiya
    Chen Yingcong
    [J]. 2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2020), 2020, : 149 - 154