Novel fuzzy clustering-based undersampling framework for class imbalance problem

被引:2
|
作者
Pratap, Vibha [1 ,2 ]
Singh, Amit Prakash [1 ]
机构
[1] Guru Gobind Singh Indraprastha Univ, USICT, New Delhi, India
[2] Indira Gandhi Delhi Tech Univ Women, Delhi, India
关键词
Class imbalance; Ensemble method; Fuzzy C-mean; Machine learning; Oversampling; Under-sampling; CLASSIFICATION; PREDICTION; SMOTE;
D O I
10.1007/s13198-023-01897-1
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The class imbalance problem occurs in various real-world datasets. Although it is considered that samples of the classes of a dataset are evenly distributed, in many cases, datasets are highly imbalanced. Classification of such datasets is challenging in machine learning. Researchers have developed many approaches to solve the class imbalance problem, such as resampling and ensemble methods. In resampling methods, minority class samples are increased (oversampling), or majority class samples are reduced (under-sampling). In contrast, the ensemble methods classify various subsets of data where classification results are combined to provide the final result. The authors have introduced a new fuzzy C-mean clustering-based under-sampling method in the present study. We performed experiments using newly proposed method over 30 small-scale imbalanced datasets. The results obtained revealed that the proposed method improves the classification performance. The average sensitivity improved by 1% and the average balance accuracy improved by 3% as compared to k-means undersampling method. The results of this study would be useful in classification of imbalanced datasets of various domains.
引用
收藏
页码:967 / 976
页数:10
相关论文
共 50 条
  • [31] Fuzzy Clustering-Based Approach for Outlier Detection
    Al-Zoubi, Moh'd Belal
    Ali, Al-Dahoud
    Yahya, Abdelfatah A.
    RECENT ADVANCES AND APPLICATIONS OF COMPUTER ENGINEERING: PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE (ACE 10), 2010, : 192 - +
  • [32] Fuzzy clustering-based on aggregate attribute method
    Wang, Jia-Wen
    Cheng, Ching-Hsue
    ADVANCES IN APPLIED ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4031 : 478 - 487
  • [33] Robust fuzzy clustering-based image segmentation
    Yang, Zhang
    Chung, Fu-Lai
    Wang Shitong
    APPLIED SOFT COMPUTING, 2009, 9 (01) : 80 - 84
  • [34] A Novel Hybrid-Based Ensemble for Class Imbalance Problem
    Guo, Huaping
    Zhou, Jun
    Wu, Chang-an
    She, Wei
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (06)
  • [35] Class-overlap undersampling based on Schur decomposition for Class-imbalance problems
    Dai, Qi
    Liu, Jian-wei
    Shi, Yong-hui
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 221
  • [36] Ensemble learning via constraint projection and undersampling technique for class-imbalance problem
    Guo, Huaping
    Zhou, Jun
    Wu, Chang-An
    SOFT COMPUTING, 2020, 24 (07) : 4711 - 4727
  • [37] Ensemble learning via constraint projection and undersampling technique for class-imbalance problem
    Huaping Guo
    Jun Zhou
    Chang-an Wu
    Soft Computing, 2020, 24 : 4711 - 4727
  • [38] A Clustering-based Framework for Fast Training of Classifiers
    Sathyamoorthy, Sruthi
    Sivasankar, E.
    2020 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN INFORMATION TECHNOLOGY (ICITIIT), 2020,
  • [39] A Clustering-based Framework for Classifying Data Streams
    Yan, Xuyang
    Homaifar, Abdollah
    Sarkar, Mrinmoy
    Girma, Abenezer
    Tunstel, Edward
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3257 - 3263
  • [40] Novel clustering-based pruning algorithms
    Paweł Zyblewski
    Michał Woźniak
    Pattern Analysis and Applications, 2020, 23 : 1049 - 1058