A GA-based Feature Selection for High-dimensional Data Clustering

被引:5
|
作者
Sun, Mei [1 ]
Xiong, Langhuan [1 ]
Sun, Haojun [1 ]
Jiang, Dazhi [1 ]
机构
[1] Shantou Univ, Dept Comp Sci & Technol, Shantou 515063, Peoples R China
关键词
feature selection; clustering; genetic algorithms; high-dimensional data;
D O I
10.1109/WGEC.2009.140
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-dimensional data clustering is an open problem in modern data mining. This paper proposed a new genetic algorithm-based feature selection for high-dimensional data clustering, called GA-FSFclustering. This approach searches effective feature subsets for clustering in all features by genetic algorithm. The candidate features and cluster centers are real number encoded. A new criterion for evaluating feature subsets is employed as the fitness function. The experimental results indicate the feasibility and efficiency of the GA-FSFclustering algorithm.
引用
收藏
页码:769 / 772
页数:4
相关论文
共 50 条
  • [1] Boosting the Convergence of a GA-based Wrapper for Feature Selection Problems on High-dimensional Data
    Carlos Gomez-Lopez, Juan
    Jose Escobar, Juan
    Francisco Diaz, Antonio
    Damas, Miguel
    Gil-Montoya, Francisco
    Gonzalez, Jesus
    [J]. PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 431 - 434
  • [2] A GA-BASED FEATURE SELECTION AND ENSEMBLE LEARNING FOR HIGH-DIMENSIONAL DATASETS
    Xia, Pei-Yong
    Ding, Xiang-Qian
    Jiang, Bai-Ning
    [J]. PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 7 - +
  • [3] A Novel GA-based Feature Selection Approach for High Dimensional Data
    De Stefano, Claudio
    Fontanella, Francesco
    di Freca, Alessandra Scotto
    [J]. PROCEEDINGS OF THE 2016 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'16 COMPANION), 2016, : 87 - 88
  • [4] Clustering high-dimensional data via feature selection
    Liu, Tianqi
    Lu, Yu
    Zhu, Biqing
    Zhao, Hongyu
    [J]. BIOMETRICS, 2023, 79 (02) : 940 - 950
  • [5] A density-based clustering algorithm for high-dimensional data with feature selection
    Qi Xianting
    Wang Pan
    [J]. 2016 2ND INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS - COMPUTING TECHNOLOGY, INTELLIGENT TECHNOLOGY, INDUSTRIAL INFORMATION INTEGRATION (ICIICII), 2016, : 114 - 118
  • [6] Differential Privacy High-Dimensional Data Publishing Based on Feature Selection and Clustering
    Chu, Zhiguang
    He, Jingsha
    Zhang, Xiaolei
    Zhang, Xing
    Zhu, Nafei
    [J]. ELECTRONICS, 2023, 12 (09)
  • [7] On online high-dimensional spherical data clustering and feature selection
    Amayri, Ola
    Bouguila, Nizar
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (04) : 1386 - 1398
  • [8] A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data
    Song, Qinbao
    Ni, Jingjie
    Wang, Guangtao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (01) : 1 - 14
  • [9] Feature selection for high-dimensional data
    Destrero A.
    Mosci S.
    De Mol C.
    Verri A.
    Odone F.
    [J]. Computational Management Science, 2009, 6 (1) : 25 - 40
  • [10] Feature selection for high-dimensional data
    Bolón-Canedo V.
    Sánchez-Maroño N.
    Alonso-Betanzos A.
    [J]. Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75