Manifold neighboring envelope sample generation mechanism for imbalanced ensemble classification

被引:1
|
作者
Wang, Yiwen [1 ]
Li, Yongming [1 ]
Shen, Yinghua [2 ]
Li, Fan [3 ]
Wang, Pin [1 ]
机构
[1] Chongqing Univ, Sch Microelect & Commun Engn, Chongqing 400044, Peoples R China
[2] Chongqing Univ, Sch Econ & Business Adm, Chongqing 400044, Peoples R China
[3] Chongqing Jiaotong Univ, Sch Informat Sci & Engn, Chongqing 40044, Peoples R China
基金
中国国家自然科学基金;
关键词
Imbalanced classification problems; Imbalanced ensemble classification; Correlation information; Envelope sample; Fuzzy c -means clustering; Domain adaptation; PREDICTION;
D O I
10.1016/j.ins.2024.121103
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For existing imbalanced ensemble (IE) methods, the sample subsets are constructed from the same dataset, which usually suffer from low quality (diversity and separability) of the subsets, so a manifold neighboring envelope sample generation mechanism (MNESG) and an imbalanced ensemble algorithm based on the mechanism (MNESG-IE) are proposed to solve this problem. First, for the original balanced subsets (OBS), a manifold neighboring sample envelope projection mechanism (MNSEP) is designed to mine the local correlation information between the samples and their nearest neighbors in the subsets. Second, the fuzzy c-means clustering (FCM) is used to further mine the global correlation information among similar samples in the subsets. Third, the sample distribution consistency preservation mechanism (SDCPM) is designed to enhance the consistency of the sample distribution before and after clustering. To better reduce the three accumulated losses above, the three steps are conducted simultaneously, thereby realizing the MNESG, which can transform the OBS into two new types of high quality envelope sample subsets - neighboring envelope sample (NES) subsets and neighboring cluster envelope sample (NCES) subsets. Finally, base classifiers are trained on the NES subsets and NCES subsets, and then fused by a two-dimensional sparse fusion mechanism (2D-SFM). Various representative IE algorithms on over thirty benchmark datasets are considered for verification. The results show that compared with the state-of-the-art IE algorithms, MNESG-IE achieves 17.79%, 17.90%, 23.61%, 18.08% improvement in terms of ACC, AUC, F-M and G-M, respectively. The major originality of the paper is: (a) proposing the MNSEP to mine the local correlation information for improving the quality of the subsets; (b) proposing the MNESG to generate high quality subsets by mining local and global correlation information simultaneously; and (c)forming an IE algorithm to better solve the imbalanced classification problem.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] Deep Fuzzy Envelope Sample Generation Mechanism for Imbalanced Ensemble Classification
    Li, Fan
    Li, Yongming
    Shen, Yinghua
    Pedrycz, Witold
    Zhang, Xiaoheng
    Wang, Pin
    Li, Pufei
    Zhou, Chuanyan
    Cheng, Huan
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (03) : 1248 - 1262
  • [2] Hierarchical manifold sample envelope transformation model for ensemble classification
    Ma, Jie
    Chen, Hong
    Li, Yongming
    Wang, Pin
    Liu, Chengyu
    Shen, Yinghua
    Pedrycz, Witold
    Wang, Wei
    Li, Fan
    COMPUTERS & ELECTRICAL ENGINEERING, 2025, 123
  • [3] Stacked fuzzy envelope consistency imbalanced ensemble classification method
    Li, Fan
    Wang, Dan
    Li, Yongming
    Shen, Yinghua
    Pedrycz, Witold
    Wang, Pin
    Wang, Yiwen
    Zhang, Wenli
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 265
  • [4] Virtual Sample Generation Approach for Imbalanced Classification
    Cao, Lu
    Shen, Hong
    2018 9TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP 2018), 2018, : 177 - 182
  • [5] Imbalanced Ensemble Algorithm Based on Envelope Learning and Hierarchical Structure Consistency Mechanism
    Li F.
    Zhang X.-H.
    Li Y.-M.
    Wang P.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (03): : 751 - 761
  • [6] Manifold Nearest Neighbor Sample Envelope and Hierarchical Multitype Transform Algorithm for Ensemble Learning
    Yan, Fang
    Ma, Jie
    Li, Yong-Ming
    Wang, Pin
    Qin, Jian
    Liu, Cheng-Yu
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (12): : 4125 - 4141
  • [7] A synthetic neighborhood generation based ensemble learning for the imbalanced data classification
    Chen, Zhi
    Lin, Tao
    Xia, Xin
    Xu, Hongyan
    Ding, Sha
    APPLIED INTELLIGENCE, 2018, 48 (08) : 2441 - 2457
  • [8] A synthetic neighborhood generation based ensemble learning for the imbalanced data classification
    Zhi Chen
    Tao Lin
    Xin Xia
    Hongyan Xu
    Sha Ding
    Applied Intelligence, 2018, 48 : 2441 - 2457
  • [9] Imbalanced data classification based on diverse sample generation and classifier fusion
    Junhai Zhai
    Jiaxing Qi
    Sufang Zhang
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 735 - 750
  • [10] Imbalanced data classification based on diverse sample generation and classifier fusion
    Zhai, Junhai
    Qi, Jiaxing
    Zhang, Sufang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (03) : 735 - 750