Denying Evolution Resampling: An Improved Method for Feature Selection on Imbalanced Data

被引:1
|
作者
Quan, Li [1 ]
Gong, Tao [1 ]
Jiang, Kaida [1 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai 201620, Peoples R China
基金
中国国家自然科学基金;
关键词
classification algorithms; imbalanced data; similarity measure; evolutionary process;
D O I
10.3390/electronics12153212
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Imbalanced data classification is an important problem in the field of computer science. Traditional classification algorithms often experience a decrease in accuracy when the data distribution is uneven. Therefore, measures need to be taken to improve the balance of the dataset and enhance the classification accuracy of the model. We have designed a data resampling method to improve the accuracy of classification detection. This method relies on the negative selection process to constrain the data evolution process. By combining the CRITIC method with regression coefficients, we establish crossover selection probabilities for elite genes to achieve an evolutionary resampling process. Based on independent weights, the feature analysis improves by 3%. We evaluated the resampled results on publicly available datasets using traditional logistic regression with cross-validation. Compared to the other resampling models, the F1 score performance of the logistic regression five-fold cross-validation is more stable than the other methods using the two sampling results of the proposed method. The effectiveness of the proposed method is verified based on F1 score evaluation results.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] When is resampling beneficial for feature selection with imbalanced wide data?
    Ramos-Perez, Ismael
    Arnaiz-Gonzalez, Alvar
    Rodriguez, Juan J.
    Garcia-Osorio, Cesar
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 188
  • [2] An Embedded Feature Selection Method for Imbalanced Data Classification
    Liu, Haoyue
    Zhou, MengChu
    Liu, Qing
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (03) : 703 - 715
  • [3] An Embedded Feature Selection Method for Imbalanced Data Classification
    Haoyue Liu
    MengChu Zhou
    Qing Liu
    IEEE/CAAJournalofAutomaticaSinica, 2019, 6 (03) : 703 - 715
  • [4] A Classification Method Based on Feature Selection for Imbalanced Data
    Liu, Yi
    Wang, Yanzhen
    Ren, Xiaoguang
    Zhou, Hao
    Diao, Xingchun
    IEEE ACCESS, 2019, 7 : 81794 - 81807
  • [5] Feature Selection in Imbalanced Data
    Kamalov F.
    Thabtah F.
    Leung H.H.
    Annals of Data Science, 2023, 10 (06) : 1527 - 1541
  • [6] A feature selection method to handle imbalanced data in text classification
    Chang, Fengxiang
    Guo, Jun
    Xu, Weiran
    Yao, Kejun
    Journal of Digital Information Management, 2015, 13 (03): : 169 - 175
  • [7] A Novel Feature Selection Method in the Categorization of Imbalanced Textual Data
    Pouramini, Jafar
    Minaei-Bidgoli, Behrouze
    Esmaeili, Mahdi
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (08): : 3725 - 3748
  • [8] Evolutionary multistage multitasking method for feature selection in imbalanced data
    Ding, Weiping
    Yao, Hongcheng
    Huang, Jiashuang
    Hou, Tao
    Geng, Yu
    SWARM AND EVOLUTIONARY COMPUTATION, 2025, 92
  • [9] Weighted Gini Index Feature Selection Method for Imbalanced Data
    Liu, Haoyue
    Zhou, MengChu
    Lu, Xiaoyu Sean
    Yao, Cynthia
    2018 IEEE 15TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC), 2018,
  • [10] Univariate feature selection on imbalanced data
    Chatterjee, Avishek
    Woodruff, Henry
    Lobbes, Marc
    Vallieres, Martin
    Seuntjens, Jan
    MEDICAL PHYSICS, 2019, 46 (11) : 5375 - 5375