Evolutionary feature selection for imbalanced data

被引:2
|
作者
Tusell Rey, Claudia C. [1 ]
Salinas Garcia, Viridiana [2 ]
Villuendas-Rey, Yenny [2 ]
机构
[1] Inst Politecn Nacl, Ctr Invest Comp, Mexico City, DF, Mexico
[2] Inst Politecn Nacl, Ctr Innovac & Desarrollo Tecnol Comp, Mexico City, DF, Mexico
关键词
evolutionary algorithms; feature selection; imbalanced data; nearest neighbor;
D O I
10.1109/ENC60556.2023.10508674
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data imbalance and high dimensionality are two of the biggest changes in machine learning. To address such issues, feature selection is one of the Data Mining techniques that has become the focus of many research works. This work introduces a novel evolutionary hybrid feature selection algorithm based on filter statistics calculation. We conducted an experimental study over 44 imbalanced datasets, to evaluate the performance of the proposed evolutionary feature selection algorithm by considering seven filter statistics in the classification of imbalanced data. As a result, we offer a ranking of the filter statistics that better suit the hybrid evolutionary approach proposed for feature selection of imbalanced data, thus contributing to improving the Nearest Neighbor classifiers.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Evolutionary multistage multitasking method for feature selection in imbalanced data
    Ding, Weiping
    Yao, Hongcheng
    Huang, Jiashuang
    Hou, Tao
    Geng, Yu
    [J]. Swarm and Evolutionary Computation, 2025, 92
  • [2] Feature Selection in Imbalanced Data
    Kamalov F.
    Thabtah F.
    Leung H.H.
    [J]. Annals of Data Science, 2023, 10 (06) : 1527 - 1541
  • [3] Univariate feature selection on imbalanced data
    Chatterjee, Avishek
    Woodruff, Henry
    Lobbes, Marc
    Vallieres, Martin
    Seuntjens, Jan
    [J]. MEDICAL PHYSICS, 2019, 46 (11) : 5375 - 5375
  • [4] Causal Feature Selection With Imbalanced Data
    Ling, Zhaolong
    Wu, Jingxuan
    Zhang, Yiwen
    Zhou, Peng
    Yu, Kui
    Jiang, Bingbing
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [5] Evolutionary feature subsets selection based on interaction information for high dimensional imbalanced data classification
    Hosseini, Elahe Sadat
    Moattar, Mohammad Hossein
    [J]. APPLIED SOFT COMPUTING, 2019, 82
  • [6] An Embedded Feature Selection Method for Imbalanced Data Classification
    Liu, Haoyue
    Zhou, MengChu
    Liu, Qing
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (03) : 703 - 715
  • [7] Feature Selection with Imbalanced Data for Software Defect Prediction
    Khoshgoftaar, Taghi M.
    Gao, Kehan
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 235 - +
  • [8] Feature selection for high-dimensional imbalanced data
    Yin, Liuzhi
    Ge, Yong
    Xiao, Keli
    Wang, Xuehua
    Quan, Xiaojun
    [J]. NEUROCOMPUTING, 2013, 105 : 3 - 11
  • [9] An Embedded Feature Selection Method for Imbalanced Data Classification
    Haoyue Liu
    MengChu Zhou
    Qing Liu
    [J]. IEEE/CAA Journal of Automatica Sinica, 2019, 6 (03) : 703 - 715
  • [10] A Classification Method Based on Feature Selection for Imbalanced Data
    Liu, Yi
    Wang, Yanzhen
    Ren, Xiaoguang
    Zhou, Hao
    Diao, Xingchun
    [J]. IEEE ACCESS, 2019, 7 : 81794 - 81807