Self-paced ensemble and big data identification: a classification of substantial imbalance computational analysis

被引:0
|
作者
Bano, Shahzadi [1 ]
Zhi, Weimei [1 ]
Qiu, Baozhi [1 ]
Raza, Muhammad [2 ]
Sehito, Nabila [3 ]
Kamal, Mian Muhammad [4 ]
Aldehim, Ghadah [5 ]
Alruwais, Nuha [6 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, 100 Sci Ave, Zhengzhou 450001, Peoples R China
[2] Xian Technol Univ, Xian, Peoples R China
[3] Zhengzhou Univ, Sch Elect Informat Engn, 100 Sci Ave, Zhengzhou 450001, Henan, Peoples R China
[4] Southeast Univ, Sch Elect Sci & Engn, Joint Int Res Lab Informat Display & Visualizat, Nanjing 210018, Peoples R China
[5] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Syst, POB 84428, Riyadh 11671, Saudi Arabia
[6] King Saud Univ, Coll Appl Studies & Community Serv, Dept Comp Sci & Engn, POB 22459, Riyadh 11495, Saudi Arabia
来源
JOURNAL OF SUPERCOMPUTING | 2024年 / 80卷 / 07期
关键词
Self-paced ensemble; Big data; Classification; Computational; Simulation; Substantial imbalance;
D O I
10.1007/s11227-023-05828-6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This research paper focuses on the challenges associated with learning classifiers from large-scale, highly imbalanced datasets prevalent in many real-world applications. Traditional algorithms learning often need better performance and high computational efficiency when dealing with imbalanced data. Factors such as class imbalance, noise, and class overlap make it demanding to learn effective classifiers. In this study, we propose a novel self-paced ensemble framework for classifying imbalanced data. The framework employs under-sampling to self-harmonize data hardness and build a robust ensemble. Extensive experimental testing demonstrates promising results in handling overlapping classes and skewed distributions while maintaining computational efficiency. The self-paced ensemble method addresses the challenges of high imbalance ratios, class overlap, and noise presence in large-scale imbalanced classification problems. By incorporating the knowledge of these challenges into our learning framework, we establish the concept of classification hardness distribution, and the self-paced ensemble is a revolutionary learning paradigm for massive imbalance categorization, capable of improving the performance of existing learning algorithms on imbalanced data and providing better results for future applications.
引用
收藏
页码:9848 / 9869
页数:22
相关论文
共 50 条
  • [31] Classification of multi-modal data in a self-paced binary BCI in freely moving animals
    Eliseyev, Andrey
    Faber, Jean
    Aksenova, Tatiana
    2011 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2011, : 7147 - 7150
  • [32] Self-paced ensemble for constructing an efficient robust high-performance classification model for detecting mineralization anomalies from geochemical exploration data
    Chen, Yongliang
    Du, Xudong
    Guo, Min
    ORE GEOLOGY REVIEWS, 2023, 157
  • [33] Deep self-paced learning for person re-identification
    Zhou, Sanping
    Wang, Jinjun
    Meng, Deyu
    Xin, Xiaomeng
    Li, Yubing
    Gong, Yihong
    Zheng, Nanning
    PATTERN RECOGNITION, 2018, 76 : 739 - 751
  • [34] SELF-PACED ENSEMBLE BASED MORTALITY PREDICTION AFTER ACUTE MYOCARDIAL INFARCTION
    Yan, Mingxuan
    Shen, Lan
    Sheng, Shuqian
    Miao, Yutong
    Lu, Yanqiao
    Gan, Xiaoying
    He, Ben
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2022, 79 (09) : 2035 - 2035
  • [35] CLASSIFICATION OF POLSAR IMAGES BASED ON SVM WITH SELF-PACED LEARNING OPTIMIZATION
    Chen, Wenshuai
    Hai, Dong
    Gou, Shuiping
    Jiao, Licheng
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 4491 - 4494
  • [36] Steel surface defects classification method based on self-paced learning
    Liang, Delong
    Chen, Dali
    Liu, Shixin
    Jia, Xu
    Zhao, Chunna
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7540 - 7545
  • [37] SPGLAD: A Self-paced Learning-Based Crowdsourcing Classification Model
    Zhang, Xianchao
    Shi, Heng
    Li, Yuangang
    Liang, Wenxin
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, 2017, 2017, 10526 : 189 - 201
  • [38] Adaptive Graph Learning for Semi-supervised Self-paced Classification
    Long Chen
    Jianbo Lu
    Neural Processing Letters, 2022, 54 : 2695 - 2716
  • [39] Single Cell Self-Paced Clustering with Transcriptome Sequencing Data
    Zhao, Peng
    Xu, Zenglin
    Chen, Junjie
    Ren, Yazhou
    King, Irwin
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (07)
  • [40] DEEP SELF-PACED LEARNING FOR SEMI-SUPERVISED PERSON RE-IDENTIFICATION USING MULTI-VIEW SELF-PACED CLUSTERING
    Xin, Xiaomeng
    Wu, Xindi
    Wang, Yuechen
    Wang, Jinjun
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2631 - 2635