Unsupervised Feature Selection and Clustering Optimization Based on Improved Differential Evolution

被引:4
|
作者
Li, Tao [1 ]
Dong, Hongbin [1 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Heilongjiang, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
基金
美国国家科学基金会;
关键词
Feature extraction; Clustering algorithms; Sociology; Statistics; Optimization; Task analysis; Manifolds; Feature selection; evolutionary clustering; differential evolution; optimization algorithm; ALGORITHM;
D O I
10.1109/ACCESS.2019.2937739
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The feature selection method based on supervised learning has been widely studied and applied to the field of machine learning and data mining. But unsupervised feature selection is still a tricky area of research because the unavailability of the label information, especially for clustering tasks. Irrelevant features and redundant features in the original data seriously block the discovery of clustering structure and weaken the performance of the subsequent classification. In order to address this problem, the unsupervised feature selection and clustering algorithm based on the evolutionary computing framework is proposed in this paper. First, the binary differential evolution algorithm is constructed for unsupervised feature selection. Specifically, the individuals of the population are used to characterize the feature subspaces and the improved Laplacian model is designed to measure the local manifold structure of each individual. Subsequently, the approximate optimal manifold structure and the corresponding feature subset are obtained. Then, the continuous differential evolutionary algorithm is executed on the optimized feature subset, in which the individual representation strategy and the integrated individual measure function are designed for clustering. Moreover, the predicted pseudo-labels are utilized to classify and further verify the validity of clustering. The experimental results demonstrate that the proposed framework outperforms the most state-of-the-art methods.
引用
收藏
页码:140438 / 140450
页数:13
相关论文
共 50 条
  • [41] A new unsupervised feature selection method for text clustering based on genetic algorithms
    Shamsinejadbabki, Pirooz
    Saraee, Mohammad
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2012, 38 (03) : 669 - 684
  • [42] Unsupervised feature selection via discrete spectral clustering and feature weights
    Shang, Ronghua
    Kong, Jiarui
    Wang, Lujuan
    Zhang, Weitong
    Wang, Chao
    Li, Yangyang
    Jiao, Licheng
    [J]. NEUROCOMPUTING, 2023, 517 : 106 - 117
  • [43] Integration of dense subgraph finding with feature clustering for unsupervised feature selection
    Bandyopadhyay, Sanghamitra
    Bhadra, Tapas
    Mitra, Pabitra
    Maulik, Ujjwal
    [J]. PATTERN RECOGNITION LETTERS, 2014, 40 : 104 - 112
  • [44] A new unsupervised feature selection method for text clustering based on genetic algorithms
    Pirooz Shamsinejadbabki
    Mohammad Saraee
    [J]. Journal of Intelligent Information Systems, 2012, 38 : 669 - 684
  • [45] Graph-based unsupervised feature selection and multiview clustering for microarray data
    Tripti Swarnkar
    Pabitra Mitra
    [J]. Journal of Biosciences, 2015, 40 : 755 - 767
  • [46] Unsupervised Feature Selection Technique Based on Genetic Algorithm for Improving the Text Clustering
    Abualigah, Laith Mohammad
    Khader, Ahamad Tajudin
    Al-Betar, Mohammed Azmi
    [J]. 2016 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT), 2016,
  • [47] UNSUPERVISED FEATURE SELECTION BASED ON FEATURE RELEVANCE
    Zhang, Feng
    Zhao, Ya-Jun
    Chen, Jun-Fen
    [J]. PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 487 - +
  • [48] Unsupervised Feature Selection with Structured Graph Optimization
    Nie, Feiping
    Zhu, Wei
    Li, Xuelong
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1302 - 1308
  • [49] Structured Graph Optimization for Unsupervised Feature Selection
    Nie, Feiping
    Zhu, Wei
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (03) : 1210 - 1222
  • [50] Differential evolution based on network structure for feature selection
    Hu, Yanmei
    Lu, Min
    Li, Xiangtao
    Cai, Biao
    [J]. INFORMATION SCIENCES, 2023, 635 : 279 - 297