The classification method based on evolutionary algorithm for high-dimensional imbalanced missing data

被引:2
|
作者
Liu, Yi [1 ]
Li, Gengsong [2 ]
Li, Xiang [1 ]
Qin, Wei [1 ]
Zheng, Qibin [1 ]
Ren, Xiaoguang [1 ]
机构
[1] Acad Mil Sci, Beijing, Peoples R China
[2] Natl Innovat Inst Def Technol, Beijing, Peoples R China
关键词
artificial intelligence; particle swarm optimisation; SELECTION;
D O I
10.1049/ell2.12842
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
High dimensional imbalanced missing data classification is a challenging and complex problem that traditional algorithms struggle to solve effectively. To address this issue, a novel method is proposed, named the hybrid classification approach based on particle swarm optimization (HCPSO). HCPSO integrates ideas of feature selection, resampling and imputation, and breaks particles down into three parts. These parts represent the feature values, probabilities of resampling, and probabilities of imputation approaches, respectively. Moreover, HCPSO employs particle swarm optimization to optimize these parameters simultaneously to take advantage of these methods. Six types of algorithms, eleven datasets, and four performance indicators are used to evaluate our method. The results demonstrate a significant improvement in HCPSO's performance, with an average improvement of 13.02%, 18.95%, 20.25%, and 28.63% for accuracy, F1, AUC, and Gmean, respectively, compared to all other methods. Furthermore, the experiments also demonstrate the robustness of HCPSO.
引用
下载
收藏
页数:3
相关论文
共 50 条
  • [41] Pattern Alternating Maximization Algorithm for Missing Data in High-Dimensional Problems
    Stadler, Nicolas
    Stekhoven, Daniel J.
    Buehlmann, Peter
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 1903 - 1928
  • [42] Pattern alternating maximization algorithm for missing data in high-dimensional problems
    Städler, Nicolas
    Stekhoven, Daniel J.
    Bühlmann, Peter
    Journal of Machine Learning Research, 2014, 15 : 1903 - 1928
  • [43] Mortality prediction based on imbalanced high-dimensional ICU big data
    Liu, Jiankang
    Chen, Xian Xiang
    Fang, Lipeng
    Li, Jun Xia
    Yang, Ting
    Zhan, Qingyuan
    Tong, Kai
    Fang, Zhen
    COMPUTERS IN INDUSTRY, 2018, 98 : 218 - 225
  • [44] High-Dimensional Expensive Optimization by Classification-based Multiobjective Evolutionary Algorithm with Dimensionality Reduction
    Horaguchi, Yuma
    Nakata, Masaya
    2023 62ND ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS, SICE, 2023, : 1535 - 1542
  • [45] Improved Contraction-Expansion Subspace Ensemble for High-Dimensional Imbalanced Data Classification
    Xu, Yuhong
    Yu, Zhiwen
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (10) : 5194 - 5205
  • [46] Quasi-Linear SVM with Local Offsets for High-dimensional Imbalanced Data Classification
    Yanze, Li
    Harutoshi, Ogai
    2020 59TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2020, : 882 - 887
  • [47] An Improved Ensemble Learning Method for Classifying High-Dimensional and Imbalanced Biomedicine Data
    Yu, Hualong
    Ni, Jun
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (04) : 657 - 666
  • [48] Improving Evolutionary Algorithm Performance for Feature Selection in High-Dimensional Data
    Cilia, N.
    De Stefano, C.
    Fontanella, F.
    di Freca, A. Scotto
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018, 2018, 10784 : 439 - 454
  • [49] Class-imbalanced classifiers for high-dimensional data
    Lin, Wei-Jiun
    Chen, James J.
    BRIEFINGS IN BIOINFORMATICS, 2013, 14 (01) : 13 - 26
  • [50] SMOTE for high-dimensional class-imbalanced data
    Rok Blagus
    Lara Lusa
    BMC Bioinformatics, 14