The classification method based on evolutionary algorithm for high-dimensional imbalanced missing data

被引:2
|
作者
Liu, Yi [1 ]
Li, Gengsong [2 ]
Li, Xiang [1 ]
Qin, Wei [1 ]
Zheng, Qibin [1 ]
Ren, Xiaoguang [1 ]
机构
[1] Acad Mil Sci, Beijing, Peoples R China
[2] Natl Innovat Inst Def Technol, Beijing, Peoples R China
关键词
artificial intelligence; particle swarm optimisation; SELECTION;
D O I
10.1049/ell2.12842
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
High dimensional imbalanced missing data classification is a challenging and complex problem that traditional algorithms struggle to solve effectively. To address this issue, a novel method is proposed, named the hybrid classification approach based on particle swarm optimization (HCPSO). HCPSO integrates ideas of feature selection, resampling and imputation, and breaks particles down into three parts. These parts represent the feature values, probabilities of resampling, and probabilities of imputation approaches, respectively. Moreover, HCPSO employs particle swarm optimization to optimize these parameters simultaneously to take advantage of these methods. Six types of algorithms, eleven datasets, and four performance indicators are used to evaluate our method. The results demonstrate a significant improvement in HCPSO's performance, with an average improvement of 13.02%, 18.95%, 20.25%, and 28.63% for accuracy, F1, AUC, and Gmean, respectively, compared to all other methods. Furthermore, the experiments also demonstrate the robustness of HCPSO.
引用
下载
收藏
页数:3
相关论文
共 50 条
  • [31] An Empirical Study on Preprocessing High-dimensional Class-imbalanced Data for Classification
    Yin, Hua
    Gai, Keke
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 1314 - 1319
  • [32] Feature selection for high-dimensional imbalanced data
    Yin, Liuzhi
    Ge, Yong
    Xiao, Keli
    Wang, Xuehua
    Quan, Xiaojun
    NEUROCOMPUTING, 2013, 105 : 3 - 11
  • [33] Clustering of imbalanced high-dimensional media data
    Šárka Brodinová
    Maia Zaharieva
    Peter Filzmoser
    Thomas Ortner
    Christian Breiteneder
    Advances in Data Analysis and Classification, 2018, 12 : 261 - 284
  • [34] Feature Selection with High-Dimensional Imbalanced Data
    Van Hulse, Jason
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    Wald, Randall
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 507 - 514
  • [35] Clustering of imbalanced high-dimensional media data
    Brodinova, Sarka
    Zaharieva, Maia
    Filzmoser, Peter
    Ortner, Thomas
    Breiteneder, Christian
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2018, 12 (02) : 261 - 284
  • [36] Research of Medical High-dimensional Imbalanced Data Classification-Ensemble Feature Selection Algorithm with Random Forest
    Zhu, Min
    Su, Bo
    Ning, Gangmin
    2017 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2017, : 273 - 277
  • [37] Missing Data Imputation with High-Dimensional Data
    Brini, Alberto
    van den Heuvel, Edwin R.
    AMERICAN STATISTICIAN, 2024, 78 (02): : 240 - 252
  • [38] A depth-based nearest neighbor algorithm for high-dimensional data classification
    Harikumar S.
    Aravindakshan Savithri A.
    Kaimal R.
    Turkish Journal of Electrical Engineering and Computer Sciences, 2019, 27 (06): : 4082 - 4101
  • [39] Defining and Evaluating Classification Algorithm for High-Dimensional Data Based on Latent Topics
    Luo, Le
    Li, Li
    PLOS ONE, 2014, 9 (01):
  • [40] A depth-based nearest neighbor algorithm for high-dimensional data classification
    Harikumar, Sandhya
    Aravindakshan Savithri, Akhil
    Kaimal, Ramachandra
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (06) : 4082 - 4101