Principal Component Analysis based Feature Selection for clustering

被引:6
|
作者
Xu, Jun-Ling [1 ]
Xu, Bao-Wen [1 ,2 ]
Zhang, Wei-Feng [3 ]
Cui, Zi-Feng [1 ]
机构
[1] Southeast Univ, Sch Engn & Comp Sci, Nanjing 211189, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Dept Comp, Nanjing 210003, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Dept Comp, Nanjing 210003, Peoples R China
基金
中国国家自然科学基金;
关键词
feature selection; Principal Component Analysis; clustering;
D O I
10.1109/ICMLC.2008.4620449
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Feature Extraction (FE) methods have been proved to be very effective for dimension reduction, but the features attained are meaningless. In order to exploit the effectiveness of FE methods to support Feature Selection (FS), this paper proposed a new FS approach for clustering based on Principal Component Analysis (PCA) called PS. It first uses PCA to transform the data from original feature space into a new feature space whose features are linear combination of the original ones, and then evaluates the importance of the original features based on the newly generated features and the feature importance measure proposed in this paper, finally selects features incrementally according to their importance to improve the performance of the clustering algorithm. Experiment is carried out on several popular data sets and the results show the advantages of the proposed approach.
引用
收藏
页码:460 / +
页数:2
相关论文
共 50 条
  • [1] Clustering and feature selection using sparse principal component analysis
    Ronny Luss
    Alexandre d’Aspremont
    Optimization and Engineering, 2010, 11 : 145 - 157
  • [2] Clustering and feature selection using sparse principal component analysis
    Luss, Ronny
    d'Aspremont, Alexandre
    OPTIMIZATION AND ENGINEERING, 2010, 11 (01) : 145 - 157
  • [3] Feature selection based on improved principal component analysis
    Li, Zhangyu
    Qiu, Yihui
    2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023, 2023, : 188 - 192
  • [4] COMBINING FEATURE SELECTION WITH EXTRACTION: UNSUPERVISED FEATURE SELECTION BASED ON PRINCIPAL COMPONENT ANALYSIS
    Li, Yun
    Lu, Bao-Liang
    Zhang, Teng-Fei
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2009, 18 (06) : 883 - 904
  • [5] Feature selection of stabilometric parameters based on principal component analysis
    L. Rocchi
    L. Chiari
    A. Cappello
    Medical and Biological Engineering and Computing, 2004, 42 : 71 - 79
  • [6] Feature selection of stabilometric parameters based on principal component analysis
    Rocchi, L
    Chiari, L
    Cappello, A
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2004, 42 (01) : 71 - 79
  • [7] Supervised feature selection using principal component analysis
    Rahmat, Fariq
    Zulkafli, Zed
    Ishak, Asnor Juraiza
    Rahman, Ribhan Zafira Abdul
    De Stercke, Simon
    Buytaert, Wouter
    Tahir, Wardah
    Ab Rahman, Jamalludin
    Ibrahim, Salwa
    Ismail, Muhamad
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (03) : 1955 - 1995
  • [8] Supervised feature selection using principal component analysis
    Fariq Rahmat
    Zed Zulkafli
    Asnor Juraiza Ishak
    Ribhan Zafira Abdul Rahman
    Simon De Stercke
    Wouter Buytaert
    Wardah Tahir
    Jamalludin Ab Rahman
    Salwa Ibrahim
    Muhamad Ismail
    Knowledge and Information Systems, 2024, 66 : 1955 - 1995
  • [9] Feature selection in principal component analysis of analytical data
    Guo, Q
    Wu, W
    Massart, DL
    Boucon, C
    de Jong, S
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2002, 61 (1-2) : 123 - 132
  • [10] Feature selection using principal component analysis and genetic algorithm
    Adhao, Rahul
    Pachghare, Vinod
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2020, 23 (02): : 595 - 602