Clustering Algorithms on Imbalanced Data Using the SMOTE Technique for Image Segmentation

被引:4
|
作者
Abeysinghe, Wajira [1 ]
Hung, Chih-Cheng [1 ,2 ]
Bechikh, Slim [3 ]
Wang, Xiaosong [1 ]
Rattani, Altaf [1 ]
机构
[1] Kennesaw State Univ, Marietta, GA 30060 USA
[2] Anyang Normal Univ, Anyang, Peoples R China
[3] Univ Tunis, SMART Lab, ISG Campus, Tunis, Tunisia
关键词
Imbalanced dataset; Image Segmentation; SMOTE; Oversampling; Undersampling;
D O I
10.1145/3264746.3264774
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Imbalanced data is a critical problem in machine learning. Most imbalanced dataset consists of one or more classes, called the minority class, which do not have enough number of samples for the recognition. Many traditional classification algorithms are unable to recognize the minority class effectively. Clustering algorithms used for image segmentation may have a high accuracy; however, none of samples in the minority class is classified correctly. In this study, we use three approaches, traditional oversampling technique, traditional undersampling technique, and the Synthetic Minority Over-sampling Technique (SMOTE), to reduce the significant difference of imbalance of the number of samples between the majority classes and the minority classes in the dataset. Fuzzy C-means algorithm (FCM) and Possibilistic Clustering Algorithm (PCA) are used to segment the images in which the samples are generated using above sampling methods. Experimental results are evaluated using the Kappa Coefficient and Confusion matrix. Our evaluation shows that the oversampling, undersampling, and SMOTE techniques can improve the imbalanced image segmentation problem with a higher accuracy([1]).
引用
收藏
页码:17 / 22
页数:6
相关论文
共 50 条
  • [1] Biomedical Image Data Segmentation with Using of Clustering Driven by Genetic Algorithms
    Kubicek, Jan
    Penhaker, Marek
    Augustynek, Martin
    Cerny, Martin
    Oczka, David
    [J]. NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_18), 2018, 303 : 101 - 107
  • [2] FUZZY AND SMOTE RESAMPLING TECHNIQUE FOR IMBALANCED DATA SETS
    Zorkeflee, Maisarah
    Din, Aniza Mohamed
    Ku-Mahamud, Ku Ruhana
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON COMPUTING & INFORMATICS, 2015, : 638 - 643
  • [3] MCBC-SMOTE: A Majority Clustering Model for Classification of Imbalanced Data
    Arora, Jyoti
    Tushir, Meena
    Sharma, Keshav
    Mohan, Lalit
    Singh, Aman
    Alharbi, Abdullah
    Alosaimi, Wael
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (03): : 4801 - 4817
  • [4] MCBC-SMOTE: A Majority Clustering Model for Classification of Imbalanced Data
    Arora, Jyoti
    Tushir, Meena
    Sharma, Keshav
    Mohan, Lalit
    Singh, Aman
    Alharbi, Abdullah
    Alosaimi, Wael
    [J]. Computers, Materials and Continua, 2022, 73 (03): : 4801 - 4817
  • [5] FUSION OF IMAGE SEGMENTATION ALGORITHMS USING CONSENSUS CLUSTERING
    Ozay, Mete
    Vural, Fatos T. Yarman
    Kulkarni, Sanjeev R.
    Poor, H. Vincent
    [J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 4049 - 4053
  • [6] Analysis Of Clustering Algorithms for MR Image Segmentation Using IQI
    Patel, S.
    Patnaik, K. S.
    [J]. 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING & SECURITY [ICCCS-2012], 2012, 1 : 387 - 396
  • [7] Handling Autism Imbalanced Data using Synthetic Minority Over-Sampling Technique (SMOTE)
    El-Sayed, Asmaa Ahmed
    Meguid, Nagwa Abdel
    Mahmood, Mahmood Abdel Manem
    Hefny, Hesham Ahmed
    [J]. PROCEEDINGS OF 2015 THIRD IEEE WORLD CONFERENCE ON COMPLEX SYSTEMS (WCCS), 2015,
  • [8] Imbalanced Data Classification using Random Subspace Method and SMOTE
    Huang, Hsiao-Yun
    Lin, Yi-Jhen
    Chen, Youg-Siang
    Lu, Hung-Yi
    [J]. 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 817 - 820
  • [9] DDSC-SMOTE: an imbalanced data oversampling algorithm based on data distribution and spectral clustering
    Li, Xinqi
    Liu, Qicheng
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (12): : 17760 - 17789
  • [10] Comparison of Image Segmentation Algorithms Based On Threshold Technique and Clustering Technique for CT Scan Images
    Manesh, M. Mahdian
    Faghihi, R.
    [J]. MEDICAL PHYSICS, 2019, 46 (06) : E566 - E566