A novel hybrid genetic algorithm with granular information for feature selection and optimization

被引:131
|
作者
Dong, Hongbin [1 ]
Li, Tao [1 ]
Ding, Rui [1 ]
Sun, Jing [1 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
基金
美国国家科学基金会;
关键词
Feature selection; Granular computing; Genetic algorithm; Rough set; Parameter optimization; PARTICLE SWARM OPTIMIZATION; CLASSIFICATION; RELEVANCE;
D O I
10.1016/j.asoc.2017.12.048
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection has been a significant task for data mining and pattern recognition. It aims to choose the optimal feature subset with the minimum redundancy and the maximum discriminating ability. This paper analyzes the feature selection method from two aspects of data and algorithm. In order to deal with the redundant features and irrelevant features in high-dimensional & low-sample data and low-dimensional & high-sample data, the feature selection algorithm model based on the granular information is presented in this paper. Thus, our research examines experimentally how granularity level affects both the classification accuracy and the size of feature subset for feature selection. First of all, the improved binary genetic algorithm with feature granulation (IBGAFG) is used to select the significant features. Then, the improved neighborhood rough set with sample granulation (INRSG) is proposed under different granular radius, which further improves the quality of the feature subset. Finally, in order to find out the optimal granular radius, granularity lambda optimization based on genetic algorithm (ROGA) is presented. The optimal granularity parameters are found adaptively according to the feedback of classification accuracy. The performance of the proposed algorithms is tested upon eleven publicly available data sets and is compared with other supervisory methods or evolutionary algorithms. Additionally, the ROGA algorithm is applied to the enterprise financial dataset, which can select the features that affect the financial status. Experiment results demonstrate that the approaches are efficient and can provide higher classification accuracy using granular information. (c) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:33 / 46
页数:14
相关论文
共 50 条
  • [1] A novel feature selection approach by hybrid genetic algorithm
    Huang, Jinjie
    Lv, Ning
    Li, Wenlong
    [J]. PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 721 - 729
  • [2] A Novel Hybrid Algorithm for Feature Selection Based on Whale Optimization Algorithm
    Zheng, Yuefeng
    Li, Ying
    Wang, Gang
    Chen, Yupeng
    Xu, Qian
    Fan, Jiahao
    Cui, Xueting
    [J]. IEEE ACCESS, 2019, 7 : 14908 - 14923
  • [3] A Hybrid Feature Selection Method Based on Genetic Algorithm and Information Gain
    He, Fei
    Yang, Huamin
    Miao, Yu
    Louis, Rainbow
    [J]. PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2016, : 320 - 323
  • [4] A hybrid genetic algorithm for feature selection wrapper based on mutual information
    Huang, Jinjie
    Cai, Yunze
    Xu, Xiaoming
    [J]. PATTERN RECOGNITION LETTERS, 2007, 28 (13) : 1825 - 1844
  • [5] Simultaneous Feature Selection Optimization Based on Hybrid Sooty Tern Optimization Algorithm and Genetic Algorithm
    Jia, He-Ming
    Li, Yao
    Sun, Kang-Jian
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (06): : 1601 - 1615
  • [6] A novel hybrid algorithm for feature selection
    Zheng, Yuefeng
    Li, Ying
    Wang, Gang
    Chen, Yupeng
    Xu, Qian
    Fan, Jiahao
    Cui, Xueting
    [J]. PERSONAL AND UBIQUITOUS COMPUTING, 2018, 22 (5-6) : 971 - 985
  • [7] A novel hybrid algorithm for feature selection
    Yuefeng Zheng
    Ying Li
    Gang Wang
    Yupeng Chen
    Qian Xu
    Jiahao Fan
    Xueting Cui
    [J]. Personal and Ubiquitous Computing, 2018, 22 : 971 - 985
  • [8] A Novel Hybrid Genetic Algorithm and Simulated Annealing for Feature Selection and Kernel Optimization in Support Vector Regression
    Wu, Jiansheng
    Lu, Zusong
    Jin, Long
    [J]. 2012 IEEE 13TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2012, : 401 - 406
  • [9] A Novel Hybrid Genetic Algorithm and Simulated Annealing for Feature Selection and Kernel Optimization in Support Vector Regression
    Wu, Jiansheng
    Lu, Zusong
    [J]. 2012 IEEE FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2012, : 999 - 1003
  • [10] Feature selection in SVM based on the hybrid of enhanced genetic algorithm and mutual information
    Zhang, Chunkai
    Hu, Hong
    [J]. MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, 2006, 3885 : 307 - 316