The classification method based on evolutionary algorithm for high-dimensional imbalanced missing data

被引：2

作者：

Liu, Yi ^{[1
]}

Li, Gengsong ^{[2
]}

Li, Xiang ^{[1
]}

Qin, Wei ^{[1
]}

Zheng, Qibin ^{[1
]}

Ren, Xiaoguang ^{[1
]}

机构：

[1] Acad Mil Sci, Beijing, Peoples R China

[2] Natl Innovat Inst Def Technol, Beijing, Peoples R China

来源：

ELECTRONICS LETTERS | 2023年 / 59卷 / 12期

关键词：

artificial intelligence; particle swarm optimisation; SELECTION;

D O I：

10.1049/ell2.12842

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

High dimensional imbalanced missing data classification is a challenging and complex problem that traditional algorithms struggle to solve effectively. To address this issue, a novel method is proposed, named the hybrid classification approach based on particle swarm optimization (HCPSO). HCPSO integrates ideas of feature selection, resampling and imputation, and breaks particles down into three parts. These parts represent the feature values, probabilities of resampling, and probabilities of imputation approaches, respectively. Moreover, HCPSO employs particle swarm optimization to optimize these parameters simultaneously to take advantage of these methods. Six types of algorithms, eleven datasets, and four performance indicators are used to evaluate our method. The results demonstrate a significant improvement in HCPSO's performance, with an average improvement of 13.02%, 18.95%, 20.25%, and 28.63% for accuracy, F1, AUC, and Gmean, respectively, compared to all other methods. Furthermore, the experiments also demonstrate the robustness of HCPSO.

引用

下载

页数：3

共 50 条

[31] An Empirical Study on Preprocessing High-dimensional Class-imbalanced Data for Classification
Yin, Hua
Gai, Keke
2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 1314 - 1319
[32] Feature selection for high-dimensional imbalanced data
Yin, Liuzhi
Ge, Yong
Xiao, Keli
Wang, Xuehua
Quan, Xiaojun
NEUROCOMPUTING, 2013, 105 : 3 - 11
[33] Clustering of imbalanced high-dimensional media data
Šárka Brodinová
Maia Zaharieva
Peter Filzmoser
Thomas Ortner
Christian Breiteneder
Advances in Data Analysis and Classification, 2018, 12 : 261 - 284
[34] Feature Selection with High-Dimensional Imbalanced Data
Van Hulse, Jason
Khoshgoftaar, Taghi M.
Napolitano, Amri
Wald, Randall
2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 507 - 514
[35] Clustering of imbalanced high-dimensional media data
Brodinova, Sarka
Zaharieva, Maia
Filzmoser, Peter
Ortner, Thomas
Breiteneder, Christian
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2018, 12 (02) : 261 - 284
[36] Research of Medical High-dimensional Imbalanced Data Classification-Ensemble Feature Selection Algorithm with Random Forest
Zhu, Min
Su, Bo
Ning, Gangmin
2017 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2017, : 273 - 277
[37] Missing Data Imputation with High-Dimensional Data
Brini, Alberto
van den Heuvel, Edwin R.
AMERICAN STATISTICIAN, 2024, 78 (02): : 240 - 252
[38] A depth-based nearest neighbor algorithm for high-dimensional data classification
Harikumar S.
Aravindakshan Savithri A.
Kaimal R.
Turkish Journal of Electrical Engineering and Computer Sciences, 2019, 27 (06): : 4082 - 4101
[39] Defining and Evaluating Classification Algorithm for High-Dimensional Data Based on Latent Topics
Luo, Le
Li, Li
PLOS ONE, 2014, 9 (01):
[40] A depth-based nearest neighbor algorithm for high-dimensional data classification
Harikumar, Sandhya
Aravindakshan Savithri, Akhil
Kaimal, Ramachandra
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (06) : 4082 - 4101

← 1 2 3 4 5 →