A novel synthetic minority oversampling technique based on relative and absolute densities for imbalanced classification

被引:10
|
作者
Liu, Ruijuan [1 ]
机构
[1] Chongqing Jianzhu Coll, Dept Publ Course, Chongqing 400072, Peoples R China
关键词
Class-imbalance learning; Class-imbalance classification; Oversampling; K nearest neighbors; Relative density; BORDERLINE-SMOTE; SAMPLING METHOD; ALGORITHM;
D O I
10.1007/s10489-022-03512-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning a classifier from class-imbalance data is an important challenge. Among the existing solutions, SMOTE has received great praise and features an extensive range of practical applications. However, SMOTE and its extensions usually degrade due to noise generation and within-class imbalances. Although multiple variations of SMOTE are developed, few of them can solve the above problems at the same time. Besides, many improvements of SMOTE are based on advanced models with introducing external parameters. To solve imbalances between and within classes while overcoming noise generation, a novel synthetic minority oversampling technique based on relative and absolute densities is proposed. First, a novel noise filter based on relative density is proposed to remove noise and smooth class boundary. Second, sparsity and boundary weights are proposed and calculated by relative and absolute densities, respectively. Third, normalized weights based on absolute and sparse weights are proposed to generate more synthetic minority class samples in the class boundary and sparse regions. The main advantages of the proposed algorithm are that: (a) It can effectively avoid noise generation while removing noise and smoothing class the boundary in original data. (b) It generates more synthetic samples in class boundaries and sparse regions; (c) No additional parameters are introduced. Intensive experiments prove that SMOTE-RD outperforms 7 popular oversampling methods in average AUC, average F-measure and average G-mean on real data sets with the acceptable time cost.
引用
收藏
页码:786 / 803
页数:18
相关论文
共 50 条
  • [31] PSO-Based Synthetic Minority Oversampling Technique for Classification of Reduced Hyperspectral Image
    Subudhi, Subhashree
    Patro, Ram Narayan
    Biswal, Pradyut Kumar
    SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2017, VOL 1, 2019, 816 : 617 - 625
  • [32] Minority-prediction-probability-based oversampling technique for imbalanced learning
    Wei, Zhen
    Zhang, Li
    Zhao, Lei
    INFORMATION SCIENCES, 2023, 622 : 1273 - 1295
  • [33] A Novel Synthetic Minority Oversampling Technique for Multiclass Imbalance Problems
    Wang, Jiao
    Awang, Norhashidah
    IEEE ACCESS, 2025, 13 : 6054 - 6066
  • [34] Distributed Synthetic Minority Oversampling Technique
    Sakshi Hooda
    Suman Mann
    International Journal of Computational Intelligence Systems, 2019, 12 : 929 - 936
  • [35] Distributed Synthetic Minority Oversampling Technique
    Hooda, Sakshi
    Mann, Suman
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (02) : 929 - 936
  • [36] Perturbation-based oversampling technique for imbalanced classification problems
    Zhang, Jianjun
    Wang, Ting
    Ng, Wing W. Y.
    Pedrycz, Witold
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (03) : 773 - 787
  • [37] Perturbation-based oversampling technique for imbalanced classification problems
    Jianjun Zhang
    Ting Wang
    Wing W. Y. Ng
    Witold Pedrycz
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 773 - 787
  • [38] CL-SR: Boosting Imbalanced Image Classification with Contrastive Learning and Synthetic Minority Oversampling Technique Based on Rough Set Theory Integration
    Gao, Xiaoling
    Jamil, Nursuriati
    Ramli, Muhammad Izzad
    APPLIED SCIENCES-BASEL, 2024, 14 (23):
  • [39] SP-SMOTE: A novel space partitioning based synthetic minority oversampling technique
    Li, Yihong
    Wang, Yunpeng
    Li, Tao
    Li, Beibei
    Lan, Xiaolong
    KNOWLEDGE-BASED SYSTEMS, 2021, 228
  • [40] Hybrid oversampling technique for imbalanced pattern recognition: Enhancing performance with Borderline Synthetic Minority oversampling and Generative Adversarial Networks
    Ahsan, Md Manjurul
    Raman, Shivakumar
    Liu, Yingtao
    Siddique, Zahed
    Machine Learning with Applications, 2025, 20