Denying Evolution Resampling: An Improved Method for Feature Selection on Imbalanced Data

被引:1
|
作者
Quan, Li [1 ]
Gong, Tao [1 ]
Jiang, Kaida [1 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai 201620, Peoples R China
基金
中国国家自然科学基金;
关键词
classification algorithms; imbalanced data; similarity measure; evolutionary process;
D O I
10.3390/electronics12153212
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Imbalanced data classification is an important problem in the field of computer science. Traditional classification algorithms often experience a decrease in accuracy when the data distribution is uneven. Therefore, measures need to be taken to improve the balance of the dataset and enhance the classification accuracy of the model. We have designed a data resampling method to improve the accuracy of classification detection. This method relies on the negative selection process to constrain the data evolution process. By combining the CRITIC method with regression coefficients, we establish crossover selection probabilities for elite genes to achieve an evolutionary resampling process. Based on independent weights, the feature analysis improves by 3%. We evaluated the resampled results on publicly available datasets using traditional logistic regression with cross-validation. Compared to the other resampling models, the F1 score performance of the logistic regression five-fold cross-validation is more stable than the other methods using the two sampling results of the proposed method. The effectiveness of the proposed method is verified based on F1 score evaluation results.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Preprocessing method based on sample resampling for imbalanced data of electronic circuits
    Li R.
    Xu A.
    Sun W.
    Wu Y.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (11): : 2654 - 2660
  • [32] An improved nonlinear correlation method for feature selection of complex data
    Du Shang
    Ang Li
    Pengjian Shang
    Nonlinear Dynamics, 2023, 111 : 11357 - 11369
  • [33] An improved nonlinear correlation method for feature selection of complex data
    Shang, Du
    Li, Ang
    Shang, Pengjian
    NONLINEAR DYNAMICS, 2023, 111 (12) : 11357 - 11369
  • [34] Binary Differential Evolution based Feature Selection Method with Mutual Information for Imbalanced Classification Problems
    Ghosh, Arka
    Xue, Bing
    Zhang, Mengjie
    2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 794 - 801
  • [35] Selection-based resampling ensemble algorithm for nonstationary imbalanced stream data learning
    Ren, Siqi
    Zhu, Wen
    Liao, Bo
    Li, Zeng
    Wang, Peng
    Li, Keqin
    Chen, Min
    Li, Zejun
    KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 705 - 722
  • [36] Feature selection via minimizing global redundancy for imbalanced data
    Huang, Shuhao
    Chen, Hongmei
    Li, Tianrui
    Chen, Hao
    Luo, Chuan
    APPLIED INTELLIGENCE, 2022, 52 (08) : 8685 - 8707
  • [37] A hybrid stacking classifier with feature selection for handling imbalanced data
    Abraham A.
    Kayalvizhi R.
    Mohideen H.S.
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04): : 9103 - 9117
  • [38] Feature selection for imbalanced data based on neighborhood rough sets
    Chen, Hongmei
    Li, Tianrui
    Fan, Xin
    Luo, Chuan
    INFORMATION SCIENCES, 2019, 483 : 1 - 20
  • [39] GA-Based Feature Selection Method for Imbalanced Data with Application in Radio Signal Recognition
    Du, Limin
    Xu, Yang
    Liu, Jun
    Ma, Fangli
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2015, 8 : 39 - 47
  • [40] Smart Robust Feature Selection (SoFt) for imbalanced and heterogeneous data
    Kasim, Henry
    King, Stephen
    Lee, Gary Kee Khoon
    Sirigina, Rajendra Prasad
    How, Shannon Shi Qi
    Hung, Terence Gih Guang
    KNOWLEDGE-BASED SYSTEMS, 2022, 236