Feature Selection for Multi-Class Imbalanced Data Sets Based on Genetic Algorithm

被引:1
|
作者
Du L.-M. [1 ,2 ]
Xu Y. [1 ]
Zhu H. [1 ]
机构
[1] Intelligent Control Development Center, Southwest Jiaotong University, Chengdu
[2] Pharmacy College of Henan University, Kaifeng
基金
中国国家自然科学基金;
关键词
Feature selection; Genetic algorithm; Multi-class imbalanced data sets; Support vector machine;
D O I
10.1007/s40745-015-0060-x
中图分类号
学科分类号
摘要
This paper presents an improved genetic algorithm based feature selection method for multi-class imbalanced data. This method improves the fitness function through using the evaluation criterion EG-mean instead of the global classification accuracy in order to choose the features which are favorable to recognize the minor classes. The method is evaluated using several benchmark data sets, and the experimental results show that, compared with the traditional feature selection method based on genetic algorithm, the proposed method has certain advantages in the size of feature subsets and improves the precision of the minor classes for multi-class imbalanced data sets. © 2015, Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:293 / 300
页数:7
相关论文
共 50 条
  • [21] Evaluating Difficulty of Multi-class Imbalanced Data
    Lango, Mateusz
    Napierala, Krystyna
    Stefanowski, Jerzy
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, ISMIS 2017, 2017, 10352 : 312 - 322
  • [22] FEATURE SELECTION FOR IMBALANCED DATASETS BASED ON IMPROVED GENETIC ALGORITHM
    Du, Limin
    Xu, Yang
    Jin, Liuqian
    [J]. DECISION MAKING AND SOFT COMPUTING, 2014, 9 : 119 - 124
  • [23] Survey on Highly Imbalanced Multi-class Data
    Hamid, Hakim Abdul
    Yusoff, Marina
    Mohamed, Azlinah
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (06) : 211 - 229
  • [24] Dynamic ensemble selection for multi-class imbalanced datasets
    Garcia, Salvador
    Zhang, Zhong-Liang
    Altalhi, Abdulrahman
    Alshomrani, Saleh
    Herrera, Francisco
    [J]. INFORMATION SCIENCES, 2018, 445 : 22 - 37
  • [25] A Novel Double Ensemble Algorithm for the Classification of Multi-Class Imbalanced Hyperspectral Data
    Quan, Daying
    Feng, Wei
    Dauphin, Gabriel
    Wang, Xiaofeng
    Huang, Wenjiang
    Xing, Mengdao
    [J]. REMOTE SENSING, 2022, 14 (15)
  • [26] Evaluating classifier predictive performance in multi-class problems with balanced and imbalanced data sets
    Vanacore, Amalia
    Pellegrino, Maria Sole
    Ciardiello, Armando
    [J]. QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2023, 39 (02) : 651 - 669
  • [27] Combined Cleaning and Resampling algorithm for multi-class imbalanced data with label noise
    Koziarski, Michal
    Wozniak, Michal
    Krawczyk, Bartosz
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 204
  • [28] Multi-class target recognition based on adaptive feature selection
    Wang, Yuehuan
    Yao, Wei
    Song, Yunfeng
    Sang, Nong
    Zhang, Tianxu
    [J]. AUTOMATIC TARGET RECOGNITION XX; ACQUISITION, TRACKING, POINTING, AND LASER SYSTEMS TECHNOLOGIES XXIV; AND OPTICAL PATTERN RECOGNITION XXI, 2010, 7696
  • [29] An oversampling method for multi-class imbalanced data based on composite weights
    Deng, Mingyang
    Guo, Yingshi
    Wang, Chang
    Wu, Fuwei
    [J]. PLOS ONE, 2021, 16 (11):
  • [30] Multi-Class Joint Rule Extraction and Feature Selection for Biological Data
    Liu, Sheng
    Patel, Ronak Y.
    Daga, Pankaj R.
    Liu, Haining
    Fu, Gang
    Doerksen, Robert
    Chen, Yixin
    Wilkins, Dawn
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 476 - 481