A No Parameter Synthetic Minority Oversampling Technique Based on Finch for Imbalanced Data

被引:1
|
作者
Xu, Shoukun [1 ]
Li, Zhibang [1 ]
Yuan, Baohua [1 ]
Yang, Gaochao [1 ]
Wang, Xueyuan [1 ]
Li, Ning [1 ]
机构
[1] Changzhou Univ, Coll Comp & Artificial Intelligence, Changzhou 213164, Jiangsu, Peoples R China
关键词
SMOTE; FINCH algorithm; Synthesis strategy; SAMPLING METHOD; SMOTE;
D O I
10.1007/978-981-99-4752-2_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The synthetic minority oversampling technique(SMOTE) has emerged as a significant approach to address class imbalance challenges in machine learning. However, the algorithm is afflicted by challenges such as the imbalanced distribution of minority class data and concerns regarding the quality of synthetic data. The enhanced variants combined with the clustering algorithm encounter the problems such as difficulty in determining the optimal value of hyperparameters and class overlap. So this paper proposes a new improved algorithm named NP-SMOTE. The core concept of the algorithm is as follows: initially, the FINCH algorithm is employed to cluster the minority class data into distinct clusters. Subsequently, the data within each cluster are categorized into boundary data and central data by determining the class of nearest neighbors for each minority class data. Finally, the appropriate synthesis methods are applied to generate data for these two classes of minority class data. This algorithm obviates the need for predetermined hyperparameters and circumvents the limitations of class overlap by synthesizing data from various classes in a customized manner. The algorithm exhibits robustness and superior generalizability as demonstrated by their comparison with commonly used algorithms across 6 datasets.
引用
收藏
页码:367 / 378
页数:12
相关论文
共 50 条
  • [21] Probability-Based Synthetic Minority Oversampling Technique
    Altwaijry, Najwa
    IEEE ACCESS, 2023, 11 : 28831 - 28839
  • [22] Minority-prediction-probability-based oversampling technique for imbalanced learning
    Wei, Zhen
    Zhang, Li
    Zhao, Lei
    INFORMATION SCIENCES, 2023, 622 : 1273 - 1295
  • [23] A Novel Adaptive Minority Oversampling Technique for Improved Classification in Data Imbalanced Scenarios
    Tripathi, Ayush
    Chakraborty, Rupayan
    Kopparapu, Sunil Kumar
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10650 - 10657
  • [24] CSMOUTE: Combined Synthetic Oversampling and Undersampling Technique for Imbalanced Data Classification
    Koziarski, Michal
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [25] Distributed Synthetic Minority Oversampling Technique
    Sakshi Hooda
    Suman Mann
    International Journal of Computational Intelligence Systems, 2019, 12 : 929 - 936
  • [26] Global Data Distribution Weighted Synthetic Oversampling Technique for Imbalanced Learning
    Wang, Zhenfei
    Wang, Hongju
    IEEE ACCESS, 2021, 9 : 44770 - 44783
  • [27] Distributed Synthetic Minority Oversampling Technique
    Hooda, Sakshi
    Mann, Suman
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (02) : 929 - 936
  • [28] Synthetic minority oversampling technique based on natural neighborhood graph with subgraph cores for class-imbalanced classification
    Zhao, Ming
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [29] Searching for Optimal Oversampling to Process Imbalanced Data: Generative Adversarial Networks and Synthetic Minority Over-Sampling Technique
    Eom, Gayeong
    Byeon, Haewon
    MATHEMATICS, 2023, 11 (16)
  • [30] SPAW-SMOTE: Space Partitioning Adaptive Weighted Synthetic Minority Oversampling Technique For Imbalanced Data Set Learning
    Zhang, Qiang
    He, Junjiang
    Li, Tao
    Lan, Xiaolong
    Fang, Wenbo
    Li, Yihong
    COMPUTER JOURNAL, 2023, 67 (05): : 1747 - 1762