A Clustering Based Genetic Algorithm for Feature Selection

被引:0
|
作者
Rostami, Mehrdad [1 ]
Moradi, Parham [1 ]
机构
[1] Univ Kurdistan, Dept Comp Engn, Sanandaj, Iran
关键词
component; feature selection; genetic algorithm; feature clustering; ANT COLONY OPTIMIZATION; FEATURE SUBSET-SELECTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Feature selection is a fundamental data preprocessing step in data mining, where its goal is removing some irrelevant and/or redundant features from a given dataset. In this paper, we present a clustering based genetic algorithm for feature selection (CGAFS). The proposed algorithm works in three steps. In the first step, Subset size is determined. In the second step, features are divided into clusters using k-means clustering algorithm. Finally, in the third step, features are selected using genetic algorithm with a new clustering based repair operation. The performance of the proposed method has been assessed on five benchmark classification problems. We also compared the performance of CGAFS with the results obtained from four existing well-known feature selection algorithms. The results show that the CGAFS produces consistently better classification accuracies.
引用
收藏
页码:112 / 116
页数:5
相关论文
共 50 条
  • [1] A feature selection Bayesian approach for a clustering genetic algorithm
    Hruschka, ER
    Hruschka, ER
    Ebecken, NFF
    [J]. DATA MINING IV, 2004, 7 : 181 - 192
  • [2] Sampling and feature selection in a genetic algorithm for document clustering
    Casillas, A
    de Lena, MTG
    Martínez, R
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2004, 2945 : 601 - 612
  • [3] Unsupervised Feature Selection Technique Based on Genetic Algorithm for Improving the Text Clustering
    Abualigah, Laith Mohammad
    Khader, Ahamad Tajudin
    Al-Betar, Mohammed Azmi
    [J]. 2016 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT), 2016,
  • [4] A fuzzy clustering based algorithm for feature selection
    Sun, HJ
    Wang, SR
    Mei, Z
    [J]. 2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 1993 - 1998
  • [5] Balanced Spectral Clustering Algorithm Based on Feature Selection
    Luo, Qimin
    Lu, Guangquan
    Wen, Guoqiu
    Su, Zidong
    Liu, Xingyi
    Wei, Jian
    [J]. ADVANCED DATA MINING AND APPLICATIONS, ADMA 2021, PT II, 2022, 13088 : 356 - 367
  • [6] A novel feature selection approach based on clustering algorithm
    Moslehi, Fateme
    Haeri, Abdorrahman
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2021, 91 (03) : 581 - 604
  • [7] FCFilter: Feature Selection based on Clustering and Genetic Algorithms
    Ferreira, Charles H. P.
    de Medeiros, Debora M. R.
    Santana, Fabiana
    [J]. 2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 2106 - 2113
  • [8] Deluge based Genetic Algorithm for feature selection
    Guha, Ritam
    Ghosh, Manosij
    Kapri, Souvik
    Shaw, Sushant
    Mutsuddi, Shyok
    Bhateja, Vikrant
    Sarkar, Ram
    [J]. EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 357 - 367
  • [9] Feature subset selection based on the genetic algorithm
    Yang, Jingwei
    Wang, Sile
    Chen, Yingyi
    Lu, Sukui
    Yang, Wenzhu
    [J]. ADVANCED TECHNOLOGIES IN MANUFACTURING, ENGINEERING AND MATERIALS, PTS 1-3, 2013, 774-776 : 1532 - +
  • [10] Deluge based Genetic Algorithm for feature selection
    Ritam Guha
    Manosij Ghosh
    Souvik Kapri
    Sushant Shaw
    Shyok Mutsuddi
    Vikrant Bhateja
    Ram Sarkar
    [J]. Evolutionary Intelligence, 2021, 14 : 357 - 367